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DEDICATION a 


As this volume of the Annual Review of Sociology was going to press, we 
learned that Judith Blake had died. Dr. Blake was an outstanding figure in the 
field of demography. We were most fortunate to have had her services as 
editor of the Annual Review of Sociology for two years, and as its associate 
editor for four years before that. Dr. Blake was committed to producing 
volumes of distinction. She continued to work valiantly during her long 
struggle with emphysema and recently concluded the preparation of Volume 
19. We dedicate this volume to her memory. 

Judith Blake had a distinguished career as a demographer, focussing es- 
pecially on family size and fertility. She received her BS and PhD from 
Columbia University. After teaching at the University of California at Berke- 
ley from 1957 to 1976, Dr. Blake became the Fred H. Bixby Professor of 
Population Policy in the School of Public Health at the University of Califor- 
nia at Los Angeles, a position she held until her recent retirement. She was 
also at different times the President of the Population Association of America 
and of the Sociological Research Association. 

Dr. Blake was the author of Family Size and Achievement, which received 
the William J. Goode Book Award in 1989 from the American Sociological 
Association. She was also the author of Family Structure in Jamaica: The 
Social Context of Reproduction, and numerous other publications. Dr. Blake 
was a fellow of the American Academy of Arts and Sciences and of the 
American Association for the Advancement of Science. 

Judith Blake will be remembered by her colleagues for her lively in- 
telligence and her skilled professionalism. We dedicate this volume to her 
with gratitude and in the hope that it is a fitting tribute to one part of her 
admirably productive professional life. 


The Editors 


PREFACE 


Our prefatory chapter by Lewis Coser contributes a European perspective on 
graduate education in the United States. Coser’s experience at Columbia was 
at the height of that institution’s powers—its faculty numbered Lynd, Mills, 
Lagersfeld, Merton, Davis, Lipset, and Hyman. Coser formed a close career- 
long relationship with Merton and benefitted in the production of The Func- 
tions of Social Conflict, his first and very successful book. Unlike the young 
New York intellectuals who predominated at Columbia, Coser saw the depart- 
ment through the eyes of one trained in European sociology, who had 
experienced the catastrophe of Hitler. He thus brings a unique vantage to that 
period of sociology’s history. 

Coser’s later career spanned years at the University of Chicago and at 
Brandeis University. His involvement in the publication of Dissent helped to 
create a journal still widely read and respected 40 years later. Coser was 
President of the American Sociological Association and the Society for the 
Study of Social Problems, and in many other ways served the profession witn 
distinction. 

Volume 19 as a whole 1s one of our most diverse, ranging from attention to 
research methods through chapters on the sociology of literature, organiza- 
tions, and risk, change in US medical care, and refugees and immigrants. 
These articles have little in common but they provide an unusually broad 
sweep of contemporary sociology. 

Finally we mourn the passing of Roberta Simmons, our close colleague on 
the editorial board (1986-1990), whom we have lost to cancer. 


Judith Blake 
Editor 
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Abstract 


This paper depicts the career of the author as a complicated journey from 
European beginnings to American destinations. It emphasizes crucial turning 
points that finally led the author from antifascist radical intellectual to 
established member of the American sociological community. It attempts to 
show lines of divergence but also lines of continuity in his thought and 
contributions. This essay attempts to show the extent to which his background 
influenced the themes and orientations of his sociological work. 

The paper may be read as a contribution to the sociology of knowledge 
from the autobiographical point of view. It stresses the various stages of the 
author’s career and attempts to show the extent to which these stages are 
reflected in his writings and general orientations. The paper closes with a 
bird’s eye view of the recent past and the probable future of American 
sociological thought. 


INTRODUCTION 


Throughout my career I have been something of an outsider professionally, 
but I also often have had the advantage of participating in the inside life of 
sociology. I have been actively engaged in our discipline yet I have never 
been totally committed to any of the various approaches that have dominated 
our field in my many years as a sociologist. 


0360-0572/93/08 15-0001S02.00 


2 COSER 
Beginnings 


My beginnings foreshadowed my subsequent career. I was born in Berlin, 
Germany in 1913 and left high school shortly before the Nazis came to power 
in 1933. I had been active in various leftist groupings in my last years in 
Berlin and I was Jewish. Under these circumstances, I was told that I might 
soon fall into the clutches of the Nazi police or of the storm troopers. I left 
Germany early in 1933 to live in Paris, while my parents, who had no political 
involvement, stayed on, hoping like so many members of the Jewish upper 
strata, that Hitler's regime would soon collapse. 

During my first few years in Paris I lived, like most of my comrades, in 
miserable and marginal ways. My parents were not allowed to support me 
financially outside Nazi Germany, and the French government did not allow 
foreign exiles to take regular jobs in the tough job market of the depression 
years. I worked as a travelling salesperson for several wholesalers, and 
somewhat later I was privileged to work as a personal secretary to a Swiss 
author and journalist. I had a good number of other jobs, some of which have 
slipped from my memory. Only after the popular front government led by 
Léon Blum came to power in 1936 was I finally given a working permit which 
enabled me to work for the French branch of an American brokerage house 
until the outbreak of war. 

My early years in Paris I was able to study at the Sorbonne because tuition 
was free and attendance at lectures was not mandatory as long as one showed 
up from time to time and passed the requisite examinations. When I first 
decided to go to the Sorbonne as a student, I was by no means clear as to 
what I was going to study. After some flirting with modern history, I finally 
decided upon comparative literature mainly because I was in command of 
French and English in addition to my native German. I did well in my first 
few semesters, so well in fact that I was not surprised when one of my 
professors, Jean Marie Carré, asked me whether I had already made plans for 
a dissertation. I told him that I wasn't sure as yet but felt I might attempt a 
comparative study of the English novel in the Victorian age, the German novel 
of roughly the same time, and the French novel of the same period. My 
strategy would be to study the various ways in which the differing social 
structures of these countries influenced various patterns of growth of the novel. 
Professor Carré threw up his hands in horrified surprise and almost shouted: 
"Social structure, my friend, is not a subject of study in comparative literature, 
that is something to be studied in sociology." I took his advice. It is under 
these somewhat unusual circumstances that I became a sociologist. 

French sociology in the interwar years turned out to be a somewhat narrow 
enterprise. At the Sorbonne, the field was still dominated by the Durkheimian 
school, and it was taught by men such as Paul Fauconnet and Célestin Bouglé, 
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who had been Durkheim’s disciples and had not deviated from a straight 
Durkheimian path. I am still able to summarize the content of this or that 
chapter in, say, Suicide, when awakened in the midst of profound sleep. I 
don’t wish to complain about this Durkheimian emphasis. I learned a greal 
deal from my teachers and the various Durkheimian authors that we were 
required to read. But I also realized that there were a number of major 
sociologists from Germany and elsewhere that were simply ignored. There 
was a rumor that a brilliant young sociologist, Raymond Aron, had gone to 
Germany for an extended study period to work on Max Weber. We knew that 
Bouglé had once been a student of Simmel in Berlin. But Bouglé chose not 
to lecture on Simmel, and Aron was not yet a member of the faculty at the 
Sorbonne. I could have listened to Marcel Mauss at the Collége de France 
across the street from the Sorbonne, but I was not particularly interested in 
the highly technical subjects in anthropology on which Mauss lectured in 
those years. Halbwachs joined the teaching staff at the Sorbonne only after I 
had to restrict the number of hours given to sociological work when I was 
employed by the American brokerage firm. 

The only major sociologist discussed outside the Durkheimian magic circle 
was Karl Marx. Not that our sociology teachers paid much attention to him, 
but there were active Marxist discussion groups, some led by Communist 
Party members, or by various other lecturers representing left-wing sects and 
communities. I had already been familiar with Marxian currents of thought 
while still in Germany, and I now studied Marxist writings with considerable 
assiduity. The Marxist allegiance also allowed me to distance myself from 
most of my professors whom I could always ignore since, after all, I 
considered that what they taught was "bourgeois sociology." And yet, in that 
lively intellectual socialist milieu, questions were being raised in discussions 
with my French and German friends. Special mention must be made of Henry 
Jacoby, a refugee from Hitler's prisons, whom I came to consider my closet 
intellectual companion in Europe and later in this country. He opened vistas 
for me that I would not have had without our long discussions over many 
years. He freed me from a Marxist orthodoxy which threatened for a while 
to rigidify my thoughts. All in all, when I arrived in the United States as a 
refugee in 1941, I still was an unorthodox Marxist with strong admixtures of 
Durkheimian thought and a somewhat vague acquaintance with German and 
British social scientists whom I had read in the 1930s. 


Coming to America 


When I came to the States just a few months before the outbreak of the 
American phase of World War II, I worked for various government agencies, 
from the Office of War Information to the Deparment of Defense. Toward 
the end of the War, I needed to focus on a personal life plan for the future. 
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I had once hoped to make a living as a journalist, a kind of leftist Walter 
Lippmann, but I soon realized that there was no chance of being successful 
in that line for a recent arrival on these shores. Together with my friend 
Travers Clement, I edited a left-wing magazine, Modern Review, for a while, 
but I soon left because of political disagreements with the financial sponsors 
of the magazine. I also began to write book reviews for periodicals such as 
The Nation, The Progressive, and Partisan Review, but it was rapidly apparent 
that this was not a way to make a living. 

Rose Laub, whom I married soon after I met her, was the person who had 
worked on my "case" at the International Relief Association when I was a 
candidate for a special visa as a political antifacist refugee. Let me anticipate 
here and say that she has been my partner by now for over 50 years. She has 
had so intimate a part in my intellectual and emotional life that I find it almost 
impossible to sort out ideas of hers that later cropped up as mine. Ours 
continued to be an ideal collaboration of mind and a common sensibility that 
nothing could shake throughout the common joys and trials of half a century. 
After we had our first child, Rose decided to continue her studies, which had 
begun at the Ecole Libre des Hautes Etudes—a branch of the New School for 
Social Research. She sought advice from other young socialists in New York 
City, people like Seymour Martin Lipset and Daniel Bell, and they suggested 
that she continue her studies at Columbia. Through her I met the young Robert 
K. Merton, whom she much admired, and Robert Lynd, who was not 
intellectually stimulating but provided much needed support. Given such 
encouragement she applied and was accepted at the university. Because I was, 
as almost always, disposed to respect ber judgment, I decided in 1948 also 
to enroll as a graduate student at Columbia. 

In the spring of that year I received a call from Nathan Glazer, another 
member of the radical student coteries in New York, asking me whether I 
knew David Riesman. I had never heard of him but was told that he had been 
a brilliant young lawyer, the last Jaw clerk of Justice Brandeis, and until 
recently he had taught at the university law school at Buffalo. Riesman had 
decided a short time ago to shift from law to a career in sociology, I was 
told, and so he joined the Social Science staff at the College of the University 
of Chicago. He was in New York to recruit additional staff. I expressed great 
interest and was delighted when Riesman, after an hour or two of intense 
conversation, asked me whether I was available to teach at the College of the 
University of Chicago. When J asked what I would be expected to teach I 
was told: American History. I could hardly believe my ears. Why on earth 
would a university in the “Wild West" hire someone from Berlin, and Paris, 
to teach American history to what I thought were "corn fed" youngsters reared 
in Midwestern small towns. (I didn't know at the time that a high proportion 
of the University of Chicago undergraduates came from the East and 
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specifically from New York). I thanked David Riesman but told him that I 
would not teach American history. I was most astonished when the Dean of 
the College called me a week or two later to talk about an appointment at the 
College. I told him that I had already declined the offer to teach American 
History. “OK,” he said, “we have simply shifted a young sociologist into the 
history section so that you can now teach in the social science/sociology field.” 
Thus, just as I had shifted to sociology in France more or less by accident, 
so it was again by accident that I came to get my first American teaching 
position in sociology. 


Beginnings in Sociology 


My years at Chicago were exciting. While I had suffered in France from the 
narrow approach to social sciences at the Sorbonne, I was now exposed to a 
wide variety of intellectual stimuli at the Chicago College. My colleagues 
varied in their interests and careers. The majority were sociologists, but there 
were also young historians, and even a young poet. But it wasn’t only the 
backgrounds of my new colleagues that were exceedingly varied. The 
materials that we were supposed teach were also quite eclectic. We lectured 
on Weber’s Protestant Ethic, and Durkheim’s Division of Labor but, we also 
discussed Margaret Mead. We presented historical approaches but also 
structural anthropology, Myrdal’s American Dilemma, and Freud’s Civiliza- 
tion and Its Discontents, There was a surfeit of honey. What was most exciting 
were weekly meetings in which the whole staff participated and presented 
their varied views on the materials to be taught the next week. I am not sure 
how much my first-year students profited from my lectures and discussion 
groups, but I am sure that I learned an immense amount during the two years 
I taught at Chicago. 

Nevertheless I left the University of Chicago after two years of teaching 
there to return to New York, to devote at least one full year to an intense 
study of various branches of Columbia’s graduate sociclogy offerings. The 
department at the time was largely, though not wholly, dominated by young 
teachers who saw themselves to be on the forefront of the development of 
sociology and who claimed allegiance to a brand new theoretical approach: 
structural functionalism. Robert K. Merton, Bernard Barber, and Kingsley 
Davis had been students at Harvard under Talcott Parsons. Other members 
of the faculty, such as Paul Lazarsfeld and Robert Lynd, while not claiming 
allegiance to the newly developing branch of theory, were at least willing, 
so it seemed, to let it have a trial run in the department. With graduate students 
such as Suzanne Keller and Hanan Selvin, we followed in the steps of people 
like Philip Selznik, Al Gouldner, Marty Lipset, Peter Blau, Zena Blau, Pete 
Rossi, Alice Rossi, and Rose Laub Coser. Not all were orthodox functional- 


6 | COSER 


ists, but all in different degrees were deeply marked by the teaching of Robert 
K. Merton and his colleagues. 

What struck me forcefully at Columbia was the sense of working on the 
frontiers of sociology, building the foundations of what was sure to become 
the most exciting subject matter in the social sciences for a long time to come. 
After the staleness of the teaching at the Sorbonne, after what sometimes 
seemed the excessive eclecticism of the Chicago College offerings, Columbia 
seemed to me to have found a most productive approach. 

And so this erstwhile Marxian cum Durkheimian sociologist began to ally 
himself with the structural-functional school. Even though I was somewhat 
older than most of my fellow students, I felt like a youngster who is suddenly 
offered a cornucopia of riches, whose existence had not even been known to 
him before. These were the years in which most of the Columbia staff reached 
or approached the height of their productivity; tbat saw the publication of 
some of the brilliant papers by Robert K. Merton, when the first structural 
functional text books such as Kingsley Davis's Human Society, were 
published, and when Talcott Parsons was at the apogee of his powers. Some 
other Columbia teachers, who were not rushing toward functionalism, 
nevertheless were willing to give to this approach a tolerant and benevolent 
hearing. It was a joy to be alive in the Columbia atmosphere in the late 1940s 
and the early 1950s. 

I was willing to pledge allegiance to the school of functional-structural 
school. And yet, there remained some major reservations that prevented me 
from claiming full adherence to traditional functionalism. Columbia was a 
special place where the traditional functionalism of Malinowski and Parsons 
was, under the leadership of Robert Merton, subjected to critical appraisal. 
Soon I found myself again a heretic within the edifice of a mother church. 
Above all in the work of Talcott Parsons but to a degree also in the work of 
many of his students, one could observe a strong bias in favor of social 
harmony, common norms, stable equilibria, and commonly accepted frame- 
works of social action. This seemed rather unpersuasive an approach to 
someone who had seen the rise of facism in Europe, the ravages of the second 
World War, the Stalinist terror and murderous violence in the USSR. The 
world of Hiroshima and Auschwitz seemed hardly the same world as that of 
such benevolent liberals as Parsons and some of his first and second generation 
of disciples. Where in their writings were the horrors and inhumanity of the 
world that we had just lived through? So it came to pass that the dissertation 
I wrote under the guidance of Merton, and the subsequent publication of my 
first book, The Functions of Social Conflict, constituted my attempt to make 
social conflict the subject of a functional mode of analysis. I attempted to 
join the exploration of tbe largely tabooed subject—social conflict—to the 
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current stress on functional analysis. I tried, in other words, to join concern 
with the causes and consequences of social conflict to an exploration of its 
functions within specific historical or theoretical structures. 

The dissertation was written under the supervision of Robert K. Merton, 
traces of whose thought can be found on almost every page. But it was also 
deeply influenced by the sociological approach of Georg Simmel. I had read 
a few of Simmel’s writings in Paris, but at the time I had not been much 
impressed. The Marxian heritage had made me tone-deaf to Simmel’s formal 
approach to sociology. But while preparing for the dissertation at Columbia, 
I reread Simmel and found him one of the most fascinating sociologists I had 
ever encountered. I was so impressed, in fact, that I proposed to Merton to 
make the work of Simmel the subject of my dissertation, but Merton was not 
encouraging. He felt that beginners in the field should devote themselves to 
a less ambitious subject and work on a more delimited problem. I was 
discouraged when I went home after my lengthy talk with Merton and was 
prepared to move to some completely different subject when I looked again 
into Simmel’s Soziologie, most which had not as yet been translated into 
English. I reread his chapter on conflict, "Der Streit"—and suddenly resolved 
that I would devote the major part of the dissertation to a discussion of the 
conflictual rather than the harmonious aspects of social phenomena. I cannot 
say that I was aware at the beginning how I would structure the dissertation, 
but I still vividly remember sitting in my study and suddenly having an 
intuition of what later became the dissertation and the book. Together, of 
course, with Merton's preponderant influence, the dissertation was written 
under the aegis of Georg Simmel. 

This is not the place to deal in any depth with this study. Suffice it to say 
that the dissertation, and the book published subsequently, was my first major 
attempt to bring about a strange marriage between functional analysis, then 
the vanguard of American sociology, and Simmel's approach. 


Intellectual Influences 


The work of Talcott Parsons assumed an important place in my intellectual 
development. I had enormous respect for him and no sympathy for many of his 
critics who attempted to picture him as a reactionary servant of the American 
power elite. As a social thinker he was, of course, quite consistent with New 
Deal types of social thought. I felt indeed that Parsons and I. given our divergent 
backgrounds and personal histories, often had different opinions, but I had high 
respect for his work, above all his magnum opus, The Structure of Social 
Action. I told Parsons later that I was indeed often in another ideological 
political camp than he, but that I was in effect what the British call His 
Majesty's Loyal Opposition. An opposition, in other words, that refuses to 
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accept many of the premises of a given set of orientations but always stresses 
that there is a great deal of common ground between contenders in the political 
and social arena. Parsons smiled when I told him that, and I venture to think 
that he found such a train of thought much to the point ... I once spent a whole 
summer reading and rereading The Structure of Social Action—and I have never 
regretted it. Even though my first academic publication in America was a 
critical review of Parsons's Essays, I have never changed my opinion that 
Parsons and Merton are the two towering figures in twentieth-century American 
sociology. Only Erving Goffman might be considered also to have as exalted 
arank among modern American sociologists. 

However, I felt a gulf between the Parsonian approach to the social sciences 
and my own emerging sociological stance. Some though probably not all of 
this can be explained, I believe, by the different milieux in which we had our 
roots. It seems fairly obvious that a person brought up in the quiet milieu of 
a college president in a small Midwestern town in the twenties would tend to 
have a different ideological and political stance in regard to his country and 
its social structure from that of somebody who had grown up in the turmoil 
of the Weimar republic, among the stresses of a culture that was breaking 
down under one's eyes. Such persons could hardly be expected to share a 
similar Weltanschauung. In subsequent years, when I taught at Brandeis and 
lived in a Boston suburb, I came to know Parsons fairly well, though not 
intimately. My respect for him and his work never faltered. 

I worked directly under Merton's influence and had a much closer 
intellectual companionship with bim than I ever had with Parsons. It might 
be said that this was because I studied at Columbia rather than at Harvard, 
but I don't think that this is the key "reason." I felt closer to a teacher who 
had grown up under circumstances and trials and tribulations that were more 
nearly similar, at least to a certain degree, to my own. But above all, I was 
seduced—if this is the right word—by Merton's intellectual style. I held with 
him that the task of the new sociology that was growing in all the major 
intellectual centers in America should be limited to developing theories of the 
middle range, rather than to pursuing the chimaera of a huge structure that 
would likely be useless to the generations of young sociologists following the 
first generation of structural-functional sociologists. 

I have no space to discuss here the various teachers at Columbia who 
influenced me at that time. Let me only mention that I was ideologically close 
to C. Wright Mills, and we had for a while a fairly close companionship, but 
I was never much influenced by him in sociological matters. The only teacher 
who influenced me in something like the same degree as Merton was Kingsley 
Davis, another young structural-functional analyst. I was never as close to 
him intellectually as to Merton, but my debt to him was considerable. 
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My Columbia degree was conferred in 1954, and the book based on my 
dissertation, The Functions of Social Conflict, appeared two years later. The 
fate of the book might be of interest. When first published, it was by and 
large well received. I recall only one wholly negative review, and this was 
by a person who specialized in nasty reviews. The book, though received 
with respect, did not create much of a stir in the 1950s. As I recall, some 
3000 copies were sold at the time. But in the 1960s things changed 
dramatically. The civil rights movement, the student movement, and the 
multifaceted conflicts that marked those years quite naturally led the young 
students and activists to ask what sociological work might be helpful to 
understand a social scene that seemed hardly understandable within the fixed 
categories of classical structural functionalism or any other sociological 
system. To be sure, Max Weber or Karl Marx was able to provide some 
guidelines, but could contemporary sociologists contribute to their enlighten- 
ment? 

The work of C. Wright Mills was widely read in those years, but Mills had 
died in 1962. There were, or so it seemed to many young budding sociologists, 
only three recent books that not only analyzed social conflict but rejected the 
otherwise pervasive stance that saw conflict as a social disease, a dysfunctional 
and destructive element. These were Ralf Dahrendorf’s Class and Class 
Conflict in Industrial Socity, Max Gluckman’s Custom and Conflict in Africa, 
and my own book. I don’t know the sales of the other two books, but my 
book suddenly became a bestseller and a "must" book in departments of 
sociology and in neighboring departments both here and abroad. My book 
was translated into seven or eight languages, was a mainstay for graduate 
student instruction, and became one of those books that a young sociologist 
simply had to read. I tell this story mainly in order to point to a more general 
lesson. A book’s success or lack thereof is only weakly related to its content; 
it is largely determined by whether it finds an audience and times that uphold 
it. 

I published another book on the sociology of conflict, Continuities in the 
Study of Social Conflict, a decade later. This book, though treated with 
respect, never approached the sales of The Functions of Social Conflict, partly 
because it advanced ideas mainly supplementary to those in the first book, 
but also partly because the social scene had reverted to what was then referred 
to as “normalcy.” In any case, I was by that time getting bored with the study 
of conflict. I felt that I had “had my day” and I didn't wish to be known as 
“Conflict Coser.” I then turned to writing several books that explored 
sociological approaches to the world of ideas. Men of Ideas discussed the 
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relation of intellectuals of various types in institutional settings. Greedy 
Institutions, a study of institutions that “devoured” their members, was well 
received but didn’t have an impact comparable to that of The Functions of 
Social Conflict partly at least because it had no direct link to what was then 
happening on the social and political scene. 


Politics and McCarthyism 


Rather than itemizing still other books and papers I wrote or coauthored, | 
want to change gears and talk about another aspect of my writings, which, 
at least on the surface, has little connection with my sociological work. 

As I explained earlier, I became a democratic socialist still in my adolescent 
years and I have never waived from this allegiance. I shed a great deal of my 
Marxian heritage, and I have perhaps lost much of the optimism of my earlier 
years, as for example the belief that the kingdom of socialism was just around 
the corner. But I have never abandoned my conviction that a socialist 
commonwealth, a social utopia, must animate our concern with the future of 
humanity, if we don’t wish to see the world slowly regimented and 
bureaucratized in an iron cage. I still believe, even though this is hardly 
popular at the present, that a better and more just egalitarian democratic society 
should be high on the agenda of contemporary political thought, recent 
evidence to the contrary notwithstanding. 

This being the case, I asked myself, soon after coming to America, in 
which ways a socialist could spread his ideas even if only through relatively 
narrow channels. I wrote essays and reviews for such publications as The New 
Republic, The Nation, Partisan Review, Politics, and Commentary—which 
was not then a central organ of the neoconservatives. In those early years 
Dwight Macdonald, a superb radical writer and editor who published Politics 
during the war years and after, taught me whatever writing skills I possess. 
Those writings seemed to differ from my sociological writings, or so critics 
averred. And I will readily admit that there was indeed a different style and 
terminology in writing for the two different audiences. But I would also 
contend that in all my writings there are common themes that do not differ 
from my basic stance about the social scene as much as might at first seem 
the case to the more casual reader. J played two types of social roles—that 
of “disinterested” and “value-neutral” sociologist, and that of the social 
commentator on the radical left—for a good number of years without 
experiencing any difficulty in combining these differing commitments. But 
things changed rapidly in the late 1940s and 1950s with the coming of 
McCarthyism to the public scene. 

In those dire days, the McCarthy influence held sway over a good part of 
American life, be it in the academy or in the forum of general public debate. 
In the academy, in particular, the McCarthy wave of red-baiting led not only 
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to some academics losing their positions, but, more frightening, it led to many 
academics falling silent and adopting an Aesopian language in order to escape 
the McCarthy attacks. To put it differently, frightening as it surely was, the 
impact of censorship by demagogues and political authorities was finally not 
as detrimental to the free play of the mind as was the self-censorship of so 
many timid souls. Given this state of affairs many of my friends and I found 
it more and more difficult to write on so-called controversial matters for liberal 
or left-of-center publications. 


Dissent REFERENCE 


Sitting one afternoon in the faculty lounge of Brandeis University where both 
of us taught, my friend Irving Howe and I expressed our unhappiness with 
this state of affairs. One of us threw out the idea that if the existing publications 
were no longer inclined to publish our writings, we should perhaps create our 
own magazine. After a while, I suggested Dissent as the title of our new 
venture. 

We resolved tentatively to call a small gathering of political friends to 
explore whether they were willing to support such a magazine, be it with 
money or ideas for papers and reviews. To our astonishment, some 50 people 
gathered in New York and urged us to go ahead. The first issue of the magazine 
appeared in 1954. Howe and I were sure at the time that the magazine would 
have but a short life—we thought in terms of one to three years. Yet, to our 
amazement, Dissent turned out to have a long life; indeed, it continues still 
to be published after almost 40 years. The magazine profited from the 
contributions of a good many sociologists, but it was not a sociological 
publication. 

I felt at the time that while it was desirable to separate social scientific 
writings and advocacy, there was no reason that a particular person could not 
play both roles and play them well. I, for one, have tried to do so. There 
have, of course, been occasions where the sociological soul in one's breast 
clashed with the socialist soul, but these difficulties have been surprisingly 
few. 

By and large Dissent became a respected voice on the left, and I dare say 
that I often run into social scientists of various orientations who assure me 
that the mixture of left radicalism and social science has been over the years 
a quite sustainable combination. All in all, Dissent has been a continued 
source of joy and satisfaction for me. My friendship with Irving Howe and 
our close collaboration on Dissent for many decades have shaped a good deal 
of my ways of thinking and feeling about America and beyond. I look with 
equal enjoyment at the yellowed pages of Dissent and, say, the American 
Journal of Sociology, both of which contain contributions from my pen. 

Dissent prints some 10,000 copies each quarter. This is, of course, a small 
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circulation compared with the mass publications, but it is doubtless read by 
many more people in libraries and elsewhere. I haven’t written as much for 
Dissent in recent years as I have in the past. This was partly because I haven't 
been in New York City, where Irving Howe, Michael Walzer, and most 
younger editors are located, but it is also because I felt that my work for 
sociological audiences might at times have a wider impact than my political 
papers. But my somewhat reduced output in Dissent by no means indicates 
a falling off of interest. 

An Overview - 

Over the last 40 or so years, I have been active in the American sociological 
community. I have served as a President of The Eastern Sociological Society 
and of the American Sociological Association and also served for some ten 
years on the Executive Commitee and later the Council of the American 
Sociological Association. I was active in the Society for the Study of Social 
Problems and served as its President. This cumulation of honors is probably 
due to the fact that I never completely pledged allegiance to any one school 
of thought and therefore lay outside the fray of contenders. 

The reasons for the fairly rapid decline of the functionalist ascendancy in 
the sixties and after have not been fully elucidated. But one of the reasons, 
fairly widely admitted by young neofunctionalists such as Jeffrey Alexander, 
has been the failure of Parsons and his disciples to come to terms on a 
theoretical level with the central importance of social conflict in human affairs. 
This being the case, I was spared some of the “poisoned arrows” directed at 
the functionalists for their neglect of social conflict. Even though I occasion- 
ally came under fire for having authored a book on the functions of social 
conflict, I was generally given some measure of grace because I had written 
a book on the functions of social conflict. 

As readers of these pages know, the last 20 or so years have been years of 
turmoil, distress, and dissension in the ranks of American sociology. The 
dominance of structural functionalism has come to an end. But no other single 
theoretical tendency has taken its place, so that one is tempted to compare 
the current state of affairs in the American sociological field to a “Tower of 
Babel.” One finds now a great number of different tendencies in sociology, 
and many spokespersons of these tendencies are now unable to understand, 
let alone speak, the language of their theoretical opponents. Given the 
incomprehension by some theorists when it comes to evaluating some of their 
theoretical antagonists, much of what passes nowadays as theoretical discourse 
is expressed, in my opinion, in the esoteric language that only the members 
of this or that theoretical sect understand. It is, however, futile to complain 
about sectarian tendencies if one’s own intellectual products may also have 
a sectarian character in the eyes of some antagonists. 
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What is needed today, it seems to me, is not only tolerance for the 
eclecticism that dominates the sociological battlefields but also, to use a phrase 
of Robert Merton, disciplined eclecticism. We are now clearly experiencing 
a crisis in American sociology. In fact we seem to be so insecure when it 
comes to envisioning sociology’s future that we are tempted for the first time 
since the days of Herbert Spencer or August Comte to hanker after foreign 
products said to be superior to the homegrown variety. Although, given my 
background, I am happy that the Age of Nativist Theorizing came to an end 
a fairly long time ago, and although we have learned a great deal from Juergen 
Habermas, Anthony Giddens, Pierre Bourdieu, and several others, we need 
not fall prey to a kind of inverse snobbism which ranks foreign products as 
necessarily superior to homegrown offerings. 

Some years ago, I wrote a book entitled Refugee Scholars in America in 
which I tried to assess the contributions of refugee scholars to the work of 
their various American scholarly disciples. I concluded that such contributions 
varied considerablely among scholarly and intellectual disciplines, but they 
surely had an immense influence on American cultural life, in general. It 
might not be too far fetched to compare the present enhanced influence of 
foreign authors to the impact of refugees in the 1930s and 1940s. In both 
cases, European thinkers have exercised considerable influence in the various 
branches of the social sciences, but they became enduring contributors only 
if they became part of the multistranded fabric of indigenous work. 

Some special comments seem to be needed about the current ascendancy 
of Marxist thought in America. When one thinks of the not too distant past 
in which Marxism was tabooed almost everywhere on the American socio- 
logical landscape, one cannot but be delighted to see how Marx's thought has 
not only infiltrated but has been absorbed in American social thought. We 
have gained immensely from finally abandoning the nefarious “Know-Noth- 
ing" attitude with which Marxism was persecuted in the recent past. We have 
witnessed over the last 20 or so years a flourishing Marxist literature that is 
truly astonishing. Yet there is something curious in the recent Marxist 
renaissance in America. While Marx stressed in all his teachings the close 
link between thought and action, the current American scholarly Marxism is, 
well, very academic indeed. The writings of many younger Marxist scholars 
seem aimed to integrate them in the academy rather than to result in social 
action. In some cases; these Marxists in quest of tenure are, it seems to me, 
rather inauthentic persons. The key Marxian idea of false consciousness has 
some application in a cultural milieu, the modern American academy, of 
which Karl Marx surely never dreamed. 

It seems that most, though not all, American sociological research now 
contributed by young Marxists is to be found in the long neglected area of 
historical sociology. In addition, a number of historical approaches in 
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contemporary sociology have taken Max Weber as their guiding light. This 
is a positive development. It seems that the mutual suspicions and misunder- 
standing that for long hindered a profitable commerce between the disciples 
of history and sociology are now breaking down. Historians now read, discuss, 
and absorb in their own works the labors of such sociological scholars as 
Emmanuel Wallerstein, Theda Skocpol, and Barrington Moore. 


The Status of Theory in Sociology 


This leads me to add a number of ideas to the perennial debate on the status 
of theoretical thought in our discipline. All too often in the past, theory and 
empirical research have had very little connection in the sociological realm. 
Empirical researchers would politely bow to theoretical contributions in the 
first paragraphs of their research reports but would then happily proceed on 
their empirical pathways without ever coming back to the theoretical propo- 
sitions. On the other hand, many theorists have tended to regard theoretical 
propositions as of value in themselves, so that they have felt no pressing need 
to validate theoretical ideas in the interplay of theory and research. I believe 
both of these stances are pernicious and detrimental to the further growth of 
our discipline. May I outline, by way of a homily, what I would consider a 
healthy development in sociology through a constant relationship between 
theory and research. 

When there is some trouble with your kitchen sink or with your toilet that 
you cannot handle yourself, you are likely to call a plumber. When he arrives 
he displays a tool kit in which he has packed a variety of tools that he expects 
to be helpful in his attempts to fix your malfunctioning appliance. Why does 
he bring a large kit containing many tools when all that seems to be needed 
are a tool or two? When asked, the plumber is likely to answer that he does 
not know beforehand which tool will be of use in fixing your appliance. He 
carries a kit with many tools because he is initially ignorant of the character 
of your problem. Once he has surveyed the situation he is able to choose the 
appropriate set of tools. 

I believe that sociologists should handle the problems of the relation 
between theory and research in roughly the same manner as the plumber. He 
may upon occasion be able to develop a theoretical tool without being 
concerned for the moment with its application, but in the long run, a tool will 
be of value only if it is applied in research operations. It is the dialectical 
interplay between tool and research problems that is, or rather should be, the 
essence of the sociologist's tasks. 

To take just one more example, the assessment of the sociological 
contributions of Emile Durkheim. One ought to recognize that had he 
published only The Rules of Sociological Method, he would not occupy a 
major position among the select group of our ancestors. It is largely because 
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he showed us the uses of his theoretical and methodological work in such 
contributions as Suicide or The Elementary Forms of Religious Life that we 
continue to honor him. In these works, he used his theoretical tools to elucidate 
or analyze specific problems and so he became one of our most honored 
founders. 

Even though Merton’s papers on the relation between theory and research 
were published a good while ago, they seem as pertinent today as they were 
in the past. Merton argued, at least in part, against Parsonian positions, that 
our major concern must be with developing theories adequate to limited ranges 
of phenomena and that this task will be hampered if attention is centered on 
grand theory in the large. What he had in mind was, of course, that overall 
general theories were so far removed from a given field of research that there 
was no way of applying them to the relatively delimited research tasks of the 
moment. He argued that the relation of theory and empirical research should 
not be regarded as a one-way street. It does not suffice to apply a general 
theory to a particular research; it has to be realized that initial applications of 
theory to a research task are likely to react back to reformulate more adequately 
the particular problem at hand. In other words, the relation between theory 
and subject matter to be explained is never a one-way street but always 
involves a dialectical interplay between the subject to be investigated and one 
or more theoretical approaches. 


Conclusion 


Compared to the good years of the 1940s. 1950s, and 1960s, American 
sociology is currently undergoing a series of lean years, but one need not be 
a devotee of a dialectical thought to predict that sooner or later the “fat” years 
will return. We have no way of knowing precisely when this will be. But I 
am sure that there will again be a time in the life of the sociological mind 
when the high hopes of several decades ago will return. One can hardly predict 
the future of sociology with precision, but one may nevertheless express the 
conviction that social science without sociology would be a much diminished 
field of human endeavors. 
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Abstract 


This review assesses sociological and historical research relevant to the 
emergence and consolidation of the American middle class in the nineteenth 
century. On the one hand, macrosociological theories have relied on a 
two-class model which renders the middle class a residual social category. 
Yet, on the other hand, community studies of the "new" social history—while 
they have opened up new avenues of inquiry into the complex social processes 
underlying middle class formation —have tended to focus on particular decades 
of the nineteenth century, leading to a fragmented view of the occupational 
composition of the middle class. Distinct literatures have developed around 
the study of particular occupational strata: artisans, small capitalists, white- 
collar wage earners, and the petite bourgeoisie. We argue here that different 
occupational groups overlap in time and represent a heterogeneous and 
historically shifting middle class rather than distinct entities. The argument 
for the integrity of a distinct middle class also rests on an understanding of 
the development of urban institutions and the cultural expressions of middle- 
class lifestyles and behavior. The expansion of this middle class, however, 
was closely linked to a growing economy and increasing equality of 
opportunity. We speculate that the reversal of these conditions, evident from 
the 1970s, may undermine the well-being of the middle class and its correlative 
social values, notably tolerance and civility. 
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INTRODUCTION 


Increasing public concern about the effects of prevailing economic conditions 
on the middle class in the United States has spurred sociological interest in 
its historical origins and development. While current debates focus on the 
instability and possible decline of the middle class, the structural and cultural 
foundations of middle-class formation in the nineteenth century are largely 
obscure. As yet, there is no coherent middle-class history. Investigations 
focused specifically on the historical development of the middle class are 
sparse, and there have been few attempts to synthesize the diverse studies of 
the “new” social history in terms of their relevance for understanding the 
middle class. 

This review assesses social historical research relevant to the emergence 
and consolidation of the middle class in the nineteenth century. The 
development of the middle class is rooted in the transformation of class and 
occupational structure that accompanied the growth of industrial-capitalism, 
rapid urbanization, immigration, and geographic mobility. Key issues ad- 
dressed in this review are: How has the middle class been defined in 
sociological and historical research? What are its central features? What were 
the social conditions of its emergence? What are the relative importance of 
structural and economic conditions in shaping middle-class identity? What 
role did institutions play in shaping cultural values for aspirants to the middle 
class? 

This article is organized as follows. First, we examine two macrosociologi- 
cal theories of industrialization and social change—post-industrial and 
Marxist theories—in terms of their treatment of the middle class in the 
nineteenth century. After a consideration of the reasons why such macro-level 
theories have deemphasized the emerging middle class, we focus on the 
studies in the "new" social and cultural history. This research has greatly 
expanded the understanding of both the structural and cultural foundations 
of middle-class identity in the nineteenth century. Two aspects of work 
were significantly altered over the course of the nineteenth century and 
formed the structural basis of the middle class: the ascendance of nonmanual 
work, and the expansion of entrepreneurship primarily through small 
retailing. Other structural underpinnings of middle class life include the 
relative geographic and occupational stability of the middle class, combined 
with high rates of mobility into the middle class. We then examine the 
social institutions and diffusion of cultural values that provided coherence 
to the middle class. Finally, we speculate on how structural economic 
changes in the twentieth century have jeopardized the cultural and social 
foundations of the middle class. 
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SOCIOLOGICAL THEORIES AND HISTORICAL DATA 


The middle class is the least studied segment of nineteenth-century American 
society, a result of sociologists’ and social historians’ focus on urban elites 
and labor, the two classes that comprise the most visible elements in 
nineteenth-century social structure. Both macrosociological theories of indus- 
trialization and social change and historical community studies have incorpo- 
rated a focus on these two classes, reflecting sociologists’ and historians’ 
dominant interest in the relationship between workers and owners. What we 
term here simply as the “two-class model” has been compelling because of 
its analytic strength in explaining the trajectory of social change under the 
emergence of capitalism. On the one hand, the persistence of rich elites in 
major US cities is especially well documented (Andrews 1941, Baltzell 1966, 
Kouwenhoven 1953). On the other hand, labor and social historians have 
illuminated the economic inequalities and exploitation of workers that 
increasingly characterized capitalist society throughout the nineteenth century 
and affected a large and rapidly expanding working class (Dawley 1976, 
Gutman 1966, Hareven 1982, Oestreicher 1986, Stott 1990, Wilentz 1984). 
The pervasiveness of the two-class model, however, has in turn led some 
social historians to call for more focused theoretical and empirical study of 
the middle class and its relationship to industrialization (Berlanstein 1988, 
Blumin 1989, Bruchey 1980, Ryan 1981, Thernstrom 1973, Zunz 1985). 


Macrosociological Theories 


Two macrosociological theories of industrial-capitalism and class formation 
seek to describe and explain changes in class structure in the last two centuries. 
These are post-industrial and Marxist theories. Both center attention on the 
economic foundations of class and emphasize occupations, economic returns 
to work, and the top and bottom of the class structure rather than the middle. 
In these theories, the fate of various middle classes—self-employed artisans, 
small entrepreneurs, and white- collar employees—is related to phases of 
industrial-capitalist development and larger structural changes in the distribu- 
tion of occupations and in the nature of work. Both propose a long-term 
historical decline of master artisans and middle-class entrepreneurs in com- 
mercial and early industrial cities with industrialization. This decline is 
attributed to mechanization, the relative contraction of industrial sectors in 
which these groups were historically concentrated, a business environment 
increasingly dominated by large enterprise, and the invasion of small retail 
markets by large corporations (Wright & Singelmann 1982). 

Post-industrial theory has largely overlooked the emergence of middle-class 
occupational groups prior to the expansion of clerical work at the end of the 
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nineteenth century. Post-industrial theory only superficially treats the replace- 
ment of both the artisanal middle class—underway since the early part of the 
century—and the “old” commercial middle class by clerical, sales, and 
managerial workers, and the transition to a service economy (Bell 1973, 
Chandler 1977, 1990, Gordon, Edwards & Reich 1982, Jacoby 1985, Mills 
1951, Singelmann 1978). Post-industrial analysts view declines in small 
capitalism and artisanal employment as reflecting structural economic trends, 
such as growth of a wage-earning labor force in manufacturing, decreasing 
agricultural employment, and increasing migration from rural areas to cities 
and towns. The decline of small capitalists is assumed to be a result of the 
concentration and centralization of capital and growth in the number of 
white-collar wage earners (Mills 1951, Steinmetz & Wright 1989, Wright & 
Singelmann 1982; see also, Blumberg 1980). 

In sum, post-industrial theory has emphasized the proliferation of white- 
collar wage earners as a middle class which emerged at the end of the 
nineteenth century, without examining in depth the social changes which led 
to this historical formation. The focus of attention has been primarily on the 
character and meaning of work in the twentieth century and the development 
of a service economy. 

Marxist theories have viewed the middle class as primarily transitional and 
have emphasized proletarianization as the absorption of master artisans and 
small capitalists into the manual working class. In this perspective, proletar- 
ianization accompanies capital accumulation and the centralization of business 
enterprise (Braverman 1974, Marx [1852] 1963, Marx & Engels [1848] 1982, 
Walton 1987). In the Marxist view, the decline of small capitalism and 
artisanal employment occurs through proletarianization and the expansion of 
the working class. However, recent studies of the petite bourgeoisie—the 
lower middle-class stratum of very small-scale self-employed—have been 
critical of this view. These analyses derive from the Marxist perspective but 
also make the critical historical observation that the petite bourgeoisie was 
not entirely absorbed into wage employment with the transition to industrial 
capitalism (Bechhofer & Elliott 1985, Blackbourn 1985, Bland, Elliott & 
Bechhofer 1978, Clegg, Boreham, & Dow 1986, Crossick & Haupt 1984, 
Gerry & Birkbeck 1981, Mayer 1975, Poulantzas 1975, Scase 1982, Scase 
& Goffee 1982). 

Both post-industrial and Marxist theories provide only a partial basis for 
understanding the origins of the middle class in the nineteenth century. While 
Marxist theories point to the proletarianization of artisans and post-industrial 
theories to the emergence of managerial capitalism as central to understanding 
change in the middle class, neither perspective relates them to each other or 
to concurrent changes in small capitalism. Moreover, they emphasize the 
macro-level economic foundations of class without a consideration of its 
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ecological, institutional, and cultural bases—which had become increasingly 
salient by the end of the nineteenth century. The remainder of this article 
examines the work of social and cultural historians as it has opened up avenues 
of inquiry into the varied processes of social change which led to the 
emergence of a distinct —albeit shifting—middle class during this era. 


How Many Middle Classes? Defining the Middle Class 


Generally the focus on capitalists and labor that has guided historical research 
on industrialization, class structure, and social change explicitly excludes the 
middle class or renders it a residual social category (Ryan 1981). This is 
especially true in the work of Katz, Doucet & Stern (1982) although they 
bave limited this model to the mid-century period (see Vinovskis 1985, Zunz 
1985). A more pervasive position is that while middle-class occupational or 
socioeconomic strata may have been present throughout most of the century, 
they did not play a significant role in the critical social changes that were 
underway. 

In both macro-level and micro-level studies, researchers have frequently 
defined or described the middle class 1n a rather circumscribed way, as one 
or another occupationally based subgroup located at a specific historical 
juncture. Four distinct literatures can be discerned, each focusing on a different 
occupational stratum. The first focuses on artisans of early and mid-century, 
whose work was primarily skilled manual self-employment (Blumin 1989, 
Dawley 1976, Griffen & Griffen 1978, 1980, Hirsch 1978, 1980, Laurie & 
Schmitz 1981, Ross 1985, Scranton 1983, Stott 1990, Wilentz 1984; see also, 
Rock 1989, Wallock 1987). A second literature looks at the entrepreneurial 
middle class of retailers, the "old" middle class based in small capitalism, of 
the mid- to late-nineteenth century (Bell 1973, Bruchey 1980, Decker 1978, 
Mills 1951); and a third at the “new” nonmanual wage-earning middle class 
of clerical, technical, and managerial workers which emerged at the end of 
the nineteenth century (Bell 1973, Blumberg 1980, Braverman 1974, 
Dahrendorf 1959, Giddens 1979, Kaelbe 1981, Kocka 1980, Whyte 1956, 
Wright 1979). Finally, some research examines the petite bourgeoisie, or 
lower middle class of petty business owners, which persisted throughout the 
nineteenth century (Bluestone 1991, Bechhofer & Elliot 1985, Mayer 1975, 
Scase 1982, Scase & Goffee 1982). 

We argue here that these occupational strata do not compose distinct middle 
classes or entities, but rather reveal a middle class characterized by hetero- 
geneity and a historically shifting social composition. In fact, the composition 
of the middle class often appears to depend on the historical period under 
consideration. Studies of the early- to mid-nineteenth century describe the 
importance of artisans in this period in which manual and nonmanual 
differences in work and social status uS Le out. In the early 
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industrial period, attention centers on small capitalists. For the latter decades 
of the nineteenth century, with the emphasis on industrialization and large 
firms, it is white-collar employees who are defined as comprising the middle 
class. 

Only Blumin (1989), in his study of nineteenth-century Philadelphia & 
New York, has dealt with the question of the timing of middle-class formation 
at length, attempting to determine the point at which middle-class identity 
took shape in both structural and cultural terms. Blumin has concluded that 
the middle class emerged in the three decades prior to the Civil War, primarily 
through a convergence in the structural circumstances and social experience 
of nonmanual workers. According to Blumin (1989), the creation of the middle 
class involved a convergence in socioeconomic status, living patterns, the 
culture of work, and social perceptions and self-identity of nonmanual 
workers. The historical transformation of “middling sorts” into a middle class 
involved a decline of skilled craft workers and master artisans as a sector of 
the urban labor force, and a concomitant decrease in their social and economic 
status in relation to an expanding nonmanual sector. 


THE MIDDLE CLASS AS A CASE STUDY IN CLASS 
FORMATION 


Community-level studies have revealed the intricacies of the social processes 
which led to middle-class formation and identity. Community studies relevant 
to the middle class vary in the attention they give to the economic and cultural 
bases of class identity. While Lynd & Lynd (1936) elaborated the economic 
factors that define and perpetuate class boundaries, Warner (1960 [1949]) 
contended that the real differences were not so much economic as linked to 
the cultural significance of lifestyles. More typically, researchers have 
attempted to be sensitive to the interplay of economic and cultural features 
Of class, yet may not make a case for the primacy of one set of factors over 
another (Gans 1962). 

The framework provided by the new social history allows an exploration 
of how workers’ social perceptions and experiences relate to structures of 
opportunity and class within the context of the social world of the middle 
class and the evolution of middle-class values and institutions. Significant 
themes which have arisen in these studies include geographic and temporal 
variation in the emergence of the middle class; variations in local social and 
economic conditions which produced these; the transformation of nonmanual 
work and entrepreneurship as a basis of middle-class employment with 
industrialization; and, historical change in the occupational experience and 
social mobility patterns of middle-class workers. 

During the last two decades, then, important progress toward a middle-class 
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history has been made, culminating in Blumin's (1989) comprehensive 
investigation of middle-class formation. Other major works include those of 
Johnson (1978), Ryan (1981), Zunz (1982, 1990); and, on the English middle 
class, Davidoff & Hall (1987). In addition, a substantial literature has 
developed on middle-class families in nineteenth century, focused on values, 
domestic ideology, and gender roles (Abelson 1990, Cott 1977, Degler 1980, 
Ryan 1981, Marsh 1990, Mintz 1983, Mintz & Kellogg 1988, Rosenzweig 
1991). These studies support an argument that the middle class comprised not 
only a significant segment of the nineteenth-century urban population, but 
that it was influential in processes of social change as well. 


Strengths and Limitations of Historical Community Studies 


Community-level studies have opened up macro-level theories of social 
change to historical investigation and reevaluation. Since the advent of the 
"new" social history in the 1960s, the analysis of industrialization, class 
formation, and social change has proceeded primarily through community- 
level studies which have emphasized variation in the timing, nature, and 
effects of industrialization (Elder 1981; see also, Zunz 1985). While the 
theoretical premises of post-industrial and Marxist theories have discouraged 
a specific focus on a third class, the empirical and inductive work of social 
historians has opened the door to a more complex view of class composition 
in nineteenth-century society. 

To achieve their essentially interdisciplinary tasks, sociologists and histo- 
rans have borrowed heavily from each other's theoretical orientations, 
concepts, and methodologies (Skocpol 1984, 1987). The perspectives of both 
disciplines are reflected in the goals of social history, as stated by Tilly (1987, 
1985): the reconstitution of the lives of ordinary people in other historical 
eras, and their connection with larger social structures and processes of social 
change. 

In the last 30 years, the development of social history has been characterized 
by an expansion of inquiry into a variety of new areas; the use of a scientific 
approach and quantitative methodologies; and the widespread employment of 
the community study method as a research technique (Zunz 1985). Both social 
science theories and the community study approach have become increasingly 
common in social-historical research. In turn, sociologists have shown greater 
concern for the historical context of larger processes of social change and 
have more often reexamined their theoretical claims through the comparative 
study of specific historical cases (Skocpol 1984, 1987). Finally, nominal 
record-linkage of historical data sources—-birth, death, and marriage records; 
tax lists; and, city directories—has proven to be a powerful technique in the 
reconstruction of individual lives and community experience in past eras 
(Jarausch & Hardy 1991, Willigan & Lynch 1982, Wrigley 1973). 
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The strength of the community study method is that it allows an ecological 
or contextual approach to the analysis of social change. Its limitations have 
also been well-documented. A critical literature developed early, in large part 
in response to questions about the viability of historical mobility studies 
(Alcorn & Knights 1975, Blumin 1968, Chudacoff 1974, Ebner 1981, 
Engerman 1975, Frisch 1974, Hardy 1989, Hauser 1982, Henretta 1977, 
Hershberg & Dockhorn 1976, Katz 1972, Sharlin 1979, Sharpless & 
Shortridge 1975, Thernstrom 1975, Warner 1968b, Williams 1981). 

While a lack of systematic data for the longitudinal study of changes in 
occupational composition and class structure in the nineteenth century 
(Anderson 1988, Conk 1978, Hollander 1980) renders a community-study 
approach necessary; the result has been a literature which is extremely diverse 
with regard to historical period, geographic and community context, and 
substantive emphases. The focus is on delimited topics which can be addressed 
with quantitative techniques, such as mobility patterns; there are questions of 
the representativeness and generalizability of individual studies; and, perhaps 
most problematic, a general lack of cumulation and synthesis. This latter 
problem may be most critical: social historians who have completed intensive 
historical community studies often do not integrate their own findings into a 
broader framework of social and cultural change. While the findings of a 
single study are limited in generalizability, they ideally allow for cumulation 
and replication across communities. How often such cumulation and replica- 
tion occurs, however, is another matter. 

In sum, if macrosociological theories have been overly broad and are 
insufficiently grounded in empirical studies of social classes, then historical 
work has sometimes become bogged down in descriptions of social and 
occupational groups without relating them in more than superficial ways to 
larger theories of change. The fragmentation of social-historical work on the 
middle class, and social class generally, stems 1n part from the difficulties 
inherent in the community study approach. | 


Structural and Cultural Foundations of the Middle Class 


The perspective of social history offers several useful vantage points for a 
critical assessment of macro-level theories of industrialization and middle- 
class formation. Central among these is an emphasis on the interaction of 
social-structural and cultural elements in the study of nineteenth-century social 
organization; an elaboration of a theory of industrialization and the gradual 
transformation of a social organization based largely on ethnicity to one based 
increasingly on social class (e.g., Zunz 1982); and, a tradition of historical 
mobility studies, in which theoretical and methodological approaches to the 
assessment of class structure and stratification have been developed. 
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The most important of these is the first, namely, the recognition in social 
history of the interdependence of social-structural and cultural elements. 
Accordingly, the emergence of the middle class was rooted in the transfor- 
mation of work under industrial-capitalism and shifts in occupational compo- 
sition over the course of the nineteenth century. Middle-class identity was 
formed through related processes of institution building, the development of 
an increasingly homogeneous middle-class culture, and the wide diffusion of 
middle-class lifestyles and cultural codes in cities. 

The remainder of this article is organized as follows. First, we examine the 
structural foundations of middle-class work and identity. These include the 
geographic and occupational stability of middle-class strata relative to other 
groups; high rates of mobility into the middle class in the industrial period; 
the elevation of nonmanual work; and, finally, the expansion of entrepreneur- 
ship through small business ownership. Next, we examine the cultural and 
institutional foundations of an emerging middle class and the diffusion of 
values and lifestyles that provided an increasing sense of coherence. Finally, 
we briefly trace the subsequent trajectory of the twentieth-century middle 
class. 


Geographic Mobility, Residential Segregation, and 
Surburbanization 


Geographic transiency and high rates of mobility within and between 
communities is a central characteristic of nineteenth-century cities (Barrows 
1981, Decker 1978, Katz 1975, Katz, Doucet & Stern 1982, Hardy 1989, 
Monkkonen 1988, Thernstrom 1973, Themstrom & Knights 1971). For 
example, Thernstrom (1973) reported nonpersistence for decadal intervals to 
range from 40% to 60% in nineteenth-century communities regardless of 
community size; and Parkerson (1982) has estimated nonpersistence rates for 
several nineteenth-century cities to be in the range of 60% to 80%. Hardy’s 
(1989) ten-year nonpersistence rate for adult males in Indianapolis is 69%; 
after adjusting for estimated mortality and enumerator error, she estimates it 
to be between 35% and 40%. 

One of the interesting implications of this fluidity of population movement 
both within and between cities is considerable inter-occupational and inter- 
ethnic contact. In spite of the tendency of ethnic and religious groups to live 
in segregated enclaves, high rates of geographic mobility contributed to the 
visibility of middle-class lifestyles and helped to foster common frameworks 
of meaning. This is a point we return to later. 

There is also evidence that geographic and residential stability were greater 
among the middle class than the working class, and that the attainment of 
middle-class status promoted such stability (Binford 1985, Warner 1962). 
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Studies have consistently documented that the likelihood of persistence 
increases directly with socioeconomic status; nonpersistence is most charac- 
teristic of young, male, manual workers. For example, Knights (1971) has 
estimated that in Boston from 1840 to 1850, the rate of nonpersistence was 
64% for unskilled manual workers but only 31% for high status white-collar 
workers. Data on Detroit suggest that entrepreneurs had a higher geographic 
persistence rate than has been found for the working class and laborers 
generally (Archer 1993). 

In addition, although white-collar workers were more likely than manual 
workers to remain in a particular city and residence, they also sorted 
themselves into distinctly middle-class neighborhoods. This trend persisted 
throughout the years of rapid suburbanization (Binford 1985, Blumin 1989, 
Monkkonen 1988, Warner 1962, 1968a). High rates of geographic persistence 
coupled with social integration among middle-class groups through both 
residence and workplace would ultimately facilitate the formation of middle- 
class identity. 


Nonmanual Work & Entrepreneurship 


The manual/nonmanual basis of work emerged over the nineteenth century as 
the major occupational division among workers, establishing the most 
significant basis of class and occupational stratification (Blumin 1989, 
Bruchey 1980, 1990, Decker 1978, Griffen & Griffen 1978, 1980). At the 
turn of the nineteenth century, artisans as well as merchants constituted the 
"middling sorts"; as indicated, for example, by their greater average wealth 
and size of residence as compared to unskilled workers (Blumin 1989). But 
the structural ascendance of nonmanual over manual work in the first half of 
the nineteenth century would create the foundation for the development of 
middle-class identity in subsequent decades. By the end of the nineteenth 
century, the manual-nonmanual basís of work clearly differentiated workers 
with respect to their opportunity for stable membership in the middle class. 

One of most significant changes associated with nonmanual work that 
occurred around mid-century was that the source of income became salaries 
and profits rather than wages. This accompanied the growth, and tendency 
toward specialization, in retailing, and the physical separation of manual and 
nonmanual work environments (Blumin 1989). Throughout this period the 
manual-nonmanual distinction became increasingly significant in its im- 
plications for workers' lives, so that the boundary between the middle and 
working classes was formed above, rather than below, skilled manual labor. 
Blumin (1989) argues that by mid-century a middle class dominated by 
entrepreneurs and salaried nonmanual workers had emerged, distinct from 
both urban elites and labor in economic circumstances and social experience 
and perceptions. 
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ARTISANS AND INDUSTRIALIZATION If artisans were dominant in defining the 
“middling sorts” in the first part of the century, then their decline is central 
to understanding subsequent middle-class formation. A large literature exists 
on the proletarianization of artisans. The preponderance of historical evidence 
suggests that work opportunities in traditional craft occupations were dwin- 
dling throughout the nineteenth century. With industrialization, much of 
artisans’ work could be mechanized or parceled out to less-skilled workers. 
Even when they remained self-employed as master craftsmen, the content, 
skill level, and relative socioeconomic rewards of skilled work declined in 
many industries (Blumin 1989, Hirsch 1978, Katz 1975, Bruchey 1980, 
Griffen & Griffen 1978). Artisan-proprietors who maintained their status as 
master craftsmen in the early industrial period remained at a greater risk than 
retailers and nonmanual business owners of falling into the manual working 
class (Archer 1993). Master artisans who became skilled laborers experienced 
an employment status shift which most likely constituted downward mobility 
in income, social status, and work autonomy (Blumin 1989). 

Some historical studies have led to refinements of this view through a 
variety of interpretations of the swiftness and totality of the decline of skilled 
artisans over the course of the nineteenth century (Form 1987). Artisans played 
a prominent role in shaping the development of industrial-capitalism in many 
communities, for example, through creation of the technology for mass 
production (Bruchey 1980, Hirsch 1980). The occupational circumstances and 
mobility patterns of artisans varied greatly by industry and craft, and whether 
or not retailing functions were incorporated into businesses (Griffen & Griffen 
1978). Of course, considerable variation occurred from one setting to another, 
leading some to question the pervasiveness of deskilling, and raising the 
possibility that the factory system actually created as many new skills as it 
eroded (Dobson 1980, Form 1987, More 1980). Even if this were true, the 
elite artisan traditions declined within the overall context of factory work. 

In later decades, artisans who entered self-employment, especially retailing, 
could still attain middle-class status. Although the expansion of markets led 
to a general decline in the local demand for crafts, in immigrant neighborhoods 
this local demand persisted. These ethnic niches have been documented for 
a variety of immigrant groups (Bodnar 1985, Bodnar, Simon & Weber 1982, 
Bruchell 1980, Conzen 1976, Kessner 1977, Lopata 1976, Morawska 1985, 
Vinyard 1976, Zunz 1982). In mid-century Poughkeepsie, there were still 
substantial opportunities for upward mobility through the transition from 
skilled manual work to proprietorships, and immigrants were especially likely 
to have made such shifts. But as business skills were becoming more essential 
than crafts skills in determining artisans’ ability to attain middle-class status, 
the ability of artisans to be successful in self-employment declined as the 
century progressed (Griffen & Griffen 1978). This lends some support to the 
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deskilling hypothesis, but more generally the evidence on artisans suggests 
both upward and downward mobility for this occupational group. 


ENTREPENEURSHIP A less explored foundation of middle-class identity in the 
nineteenth century is small capitalism. The post-Civil War decades saw both 
the ascendance and consolidation of large-scale industrial-capitalism and the 
expansion of small capitalism in retailing and proprietorships (Bruchey 1980, 
Porter 1973, Porter & Livesay 1971, Vatter 1980). Although the importance 
of small capitalism has often been slighted, and self-employment did in fact 
decline relative to the growth of manufacturing, a fairly large segment of the 
labor force was self-employed throughout the century’s last decades. 

Estimates of the percentage of the workforce that was self-employed in the 
nineteenth century vary. Such estimates are problematic given the inadequacy 
of the manuscript census for the systematic identification of self-employment 
status. Small capitalism probably accounted for half of all middle-class 
workers in commercial and early industrial cities. Interestingly, the social 
homogeneity which currently characterizes entrepreneurs (Form 1985, Becker 
1984, Bregger 1982) appears to have existed in industrializing cities as well 
(Archer 1991a, Zunz 1982). 

Griffen & Griffen (1978) examined the entire labor force of Poughkeepsie, 
utilizing both manuscript census and city directory data. They estimate that 
self-employment accounted for the bulk of middle-class workers at mid-cen- 
tury, and that 17% of all working males were self-employed. In contrast, 
clerical workers formed a smaller proportion of the male labor force: only 
about 4 to 1196 throughout the period examined. In Detroit in 1880, the 
self-employed (excluding self-employed professionals) accounted for about 
12% of the total labor force (Archer 1991a). Decker’s (1978) estimates for 
San Francisco in 1880 are comparable: he found all segments of the 
self-employment to comprise about 1496 of the total labor force. 

Industrialization expanded opportunities for ownership of retail shops and 
some types of business proprietorships, which increased the rate of upward 
mobility (Griffen & Griffen 1978, 1980, Decker 1978). This helped to create 
middle-class social networks within emerging commercial districts (Blumin 
1989, Conzen & Conzen 1979, Muller 1987, Porter & Livesay 1971, Ward 
1971). Concurrently, retailers and small businesses became aligned with a 
national culture of business institutions and practices, such as banking and 
credit, as they emerged with the growth of corporate capitalism (Cochran 
1961, Degler 1967, Kirkland 1971). 

The expansion of opportunities for retailing also meant that the occupational 
stability of retailers was less dependent on traditional sources of social and 
economic support, such as family enterprise (Archer 1991b). The skills used 
by retailers were less circumscribed than those of self-employed artisans, 
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which meant that retailers could more easily shift among types of business 
enterprise or enter the white-collar world of clerical and managerial employ- 
ment. 

Entrepreneurship also emerged in the late nineteenth century as the central 
theme of a middle-class ideology of success (Berthoff 1980, Bruchey 1990, 
Burns 1976, Cawelti 1965, Pessen 1977, Robertson 1973, Weiss 1969). The 
cultural ideal of entrepreneurship has continued to reassert itself when, for 
example, in the critique of bureaucratization in the twentieth century the work 
and lifestyles of white-collar employees are contrasted with an idealized image 
of those of small entrepreneurs (Bell 1973, Mills 1951, Reisman 1961, Slater 
1976, Whyte 1956). 


THE EXPANSION OF NONMANUAL EMPLOYMENT The most visible change in 
the nineteenth century that signified a major shift in the occupational and 
social composition of the middle class was the ascendence of clerical and 
managerial work and the proliferation of workplaces that hired nonmanual 
workers (Aron 1987, Bell 1973, Benson 1986, Chandler 1977, Davies 1982, 
Mills 1951). While artisans-proprietors and small business owners certainly 
did not disappear in the final decades of the nineteenth century, they comprised 
an increasingly small segment of the middle class as clerical work expanded. 

Decker (1978) found that, in San Francisco, the number of clerical and 
managerial workers—as well as the number of manufacturers—doubled 
between 1852 and 1880. Overall, white-collar workers comprised 36% of the 
labor force in 1852 but 44% by 1880; merchants were 14% of the labor force 
in 1852 but only 8% in 1880. This signified not only industrialization but the 
growth of managerial capitalism (Chandler 1977, 1990, Cochran 1961, Degler 
1967, Jacoby 1985, Kirkland 1971). Zunz (1990) examined the social origins 
of clerical and managerial workers in the major industries of Detroit to explore 
the crucial role of the expanding white-collar labor force in the development 
of a corporate economy. 


Wealth and Social Mobility 


Occupational shifting was common in the nineteenth century, albeit often 
within the same social class stratum (Katz 1975, Thernstrom 1973, Griffen 
& Griffen 1978). However, middle-class workers were characterized by 
greater occupational stability than manual workers, and this was true for 
nonmanual wage earners such as clerks and salesmen, as well as the 
self-employed. 

In addition, toward the end of the century there were more individuals 
moving into the middle class than there were leaving it. Again, this was due 
to the expansion of nonmanual wage employment. Career mobility studies 
have consistently shown that, in the last third of the nineteenth century, 
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“climbers” predominated over “skidders”: movement from manual to non- 
manual work was more common than the reverse. Thernstrom’s (1973) study 
of Boston, using data coded from the manuscript census schedules, estimates 
the likelihood of upward career mobility among male workers to have been 
about one in four for manual-to-nonmanual work; but only about one in six 
for nonmanual-to-manual work. This pattern has been supported in studies of 
Omaha (Chudacoff 1972), Poughkeepsie (Griffen & Griffen 1978), San 
Francisco (1980), and in Pennsylvania in the early twentieth century (Gold- 
stein 1958). Once middle-class status was attained, it was likely to have been 
retained: even with job shifts, 70% to 80% of middle-class workers 1n both 
Boston and Poughkeepsie between 1880 and 1900 remained in the middle 
class. 


THE ROLE OF CULTURE 


The general thesis of cultural historians is that high rates of upward mobility 
combined with accessible codes of middle-class culture helped to shape a 
common class identity throughout the period of post-bellum America. In 
Raymond Williams' (1961) terms, a major component of the nineteenth 
century was the emergence of a new "structure of feeling" that was both the 
expression and the legitimization of middle-class behavior and ideals, and 
that was requisite to industrialization and nation building. This "structure of 
feeling” was also indispensable for social control under rapidly changing 
conditions. 

The importance attached to middle-class culture is hard to reconcile with 
prevailing social and economic conditions. The period was marked by high 
rates of immigration (Lieberson 1980), disruptive social change and consid- 
erable poverty (Rus [1890] 1957), exploitation of workers and appalling 
industrial conditions (Gutman 1966), and a high degree of residential 
segregation that reflected ethnic antagonisms (Katznelson 1981, Hershberg 
1981). Moreover, the nineteenth century exhibited considerable economic 
inequalities. Estimates for northern US cities suggest that around 1870, the 
top 0.1% had between 15% to 24% of the wealth, and over 50% of the 
population had no real or personal wealth (Soltow 1969, Gallman 1969). 

It is, therefore, surprising, that instead of increasing polarization, a genuine 
shared culture was shaped by what Blumin (1989) termed the “middlin folk.” 
From the congeries of occupations that comprised the growing urban middle 
class emerged a life style, set of cultural codes, behaviors, and conventions 
that helped to define an increasingly homogeneous civic order. Part of the 
explanation for the expansion of middle-class culture was an improving 
standard of living. Wages (in real dollars) increased about 5096 between 1860 
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and 1890 (Long 1960, Lebergott 1984). This created msing expectations, an 
eagerness to participate in urban culture, and a genuine expansion in 
opportunities (Burns 1985). The other part of the explanation relates to the 
ease of acquiring cultural codes of middle-class behavior and life style, and 
the loose fit between affluence and the wherewithal to ape middle-class 
decorum in public places. A modest civic life, void of aristocratic conditions 
(Tocqueville [1836] 1969, p. 266) contributed in no small measure to the 
success of middle-class culture. 

As Kammen describes it, "[conditions] in America meant that indications 
of inequality would be less visible than in Europe" (1980, p. 277). A main 
mechanism, it is argued here, was the proliferation of middle-class institutions 
that were widely accessible to the rich and poor, and the high—and often 
realistic—expectations for mobility, if not for oneself, then at least for one's 
children. A component of this story is that urban elites were eager to justify 
themselves to the masses (Bender 1987, p. 173), if not to edify and uplift 
cultural tastes (DiMaggio 1982, Blau 1991). Another mechanism that made 
possible a viable middle-class culture was a deep-rooted commitment to 
simplicity in American life (Shi 1985), which was linked in turn to the highly 
practical and anti-elitist approach to lifestyle and consumption. The fact that 
practicality and democracy pervaded education as well contributed to the ease 
Of access of people with humble origins to middle-class status (Cremin 1988). 
Thus, one of the major reasons that middle-class values, life-style, and 
manners were so successfully diffused throughout the urban population was 
the distrust of elite traditions, which accompanied a pervasive repudiation of 
the past (Kammen 1978, pp. 221—229). Social origins mattered relatively little 
when there was so much diversity, and respectability could be traced no more 
than one or two generations. 

Under conditions of rapid upward social mobility that characterized the 
post-bellum period in northern cities, defining class membership was serious 
business, requiring cultural edgeworking that drew from lifestyle, residence, 
and affiliations. Yet, at the same time there was a profound ambivalence about 
drawing boundaries too rigidly in a growing economy. 

The popular drive for refinement in the years after the Civil War is 
manifestly evident in literary monthlies, etiquette books, and household 
manuals. Newspapers and public oratory were increasingly cast in popular, 
and even "vulgar" language, and there was considerable slippage between 
language and class. As Cmiel observes (1990, p. 200) “One did not speak in 
a certain way because one belonged to a particular socioeconomic class. One 
found a place in the social order by one's cultural attainments." Historian 
Kasson (1990), building on the work of Norbert Elias (1978), draws attention 
to the ease with which it was possible to learn manners, the control of 
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emotions, and rituals of polite behavior. The codes of civility helped to 
integrate vast numbers of rural to urban immigrants and, most especially, 
foreign immigrants into an amorphous middle class. 

Paradoxically, as the century wore on, a tension developed within the 
middle class that would have future significance. On the one hand, middle- 
class culture was rooted in locale—the city, the suburbs, and its institutions— 
but successful middle-class occupations were increasingly dependent on the 
national integration of the economy and professions (Hall 1984, Wiebe 1967, 
pp. 111-132, Bledstein 1976, Larson 1977). Accompanying the decline of 
the meaning of locale, the end of the century brought a moral uncertainty, 
which, according to Lears (1981, 1983) can be traced to the increasing 
vagueness and irrelevance of liberal Protestantism, and a sense of ennui that 
accompanied a recognition of the increasing consolidation of wealth and the 
power of national business and bureaucratic elites. 

Veblen ([1899] 1953) had long before drawn attention to the pecuniary 
traits and sheer irresponsibility of the very rich, but did not, at the time he 
wrote, fully realize the significance of industrial wealth for a national system 
of privilege and status, and of the institutional linkages that integrated various 
elites. Drawing attention to this transcendent consolidation, Lears (1981) 
points out that instead of opposing the pecuniary values of the very rich, 
members of the middle class (now disengaged from deep religious involve- 
ment) immersed themselves in imitation of the rich. Consumerism became a 
means of status competition and a therapeutic quest for self-fulfillment and 
gratification. Others (Allen 1983; see also, Haskell 1984) have explored how 
artists, scientists, intellectuals, and educators accommodated themselves to 
the status politics of acquisition and material accumulation. 

This shift from the importance of moral worth to social and economic value 
refocused the motivations of members of the middle class but may have left 
them vulnerable to the social and economic transformations of the late 
twentieth century. We return to this argument in a later section, after 
describing the distinctive ways that urban and rural places helped to fashion 
comparable moral orders in which enterprise was linked with equalitarian and . 
civic values. 


Urban Institutions 


The spirit of volunteerism swept American cities in the nineteenth century. 
Initially fueled by religious revivals in the 1820s, the first generation of 
volunteers were drawn to charity in a very personal and direct way (McCarthy 
1982; see also, Miller 1966, Horowitz 1976). However, increasingly through- 
out the century, those who became wealthy withdrew from direct community 
involvement and were replaced by a corps of middle-class professionals and 
bureaucrats who standardized charitable services (McCarthy 1982). The 
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history of volunteerism and its transformation highlights several things about 
the development of the middle class and its relation to the urban poor. First, 
it reveals the changing nature of the relationship of the middle class to the 
nonprofit sector, from the doers in the early 1820s, to the technocrats of the 
early twentieth century. Second, women’s sustained involvement in volunteer 
work gave them power outside the domestic realm, which undoubtedly helped 
to increase labor force participation—although not for the very rich (see 
Daniels 1988)—and provided momentum to the suffragette movement 
(DuBois 1987). Third, charity and volunteerism were embedded in contradic- 
tory motivations: social reform and amelioration as well as social control 
(Minihan 1977, Sutton 1991, Schuyler 1986). 

Other urban institutions, besides volunteerism, played an important, 
although a less obvious, role in diluting the ethnic and religious identities of 
new immigrants and in diffusing middle-class cultural codes. These included 
the department store, vaudeville, public theaters, newspapers, new forms of 
city transit, city parks, spectator sports, city festivals, traveling exhibits, and, 
later, the cinema (Ratzel [1876] 1988, Kasson 1978, Barth 1980, Ryan 1989, 
Kasson 1978). Overproduction and the ready availability of inexpensive 
clothing and housewares were especially important late in the century, as 
these factors contributed to widespread consumerism and the diffusion of 
middle-class display, both authentic and derivative (Orvell 1989). 


Rural America 


The success of organized philanthropy, consumer products, and leisure 
institutions in modulating diversity in cities was less applicable to rural 
populations, for which lower densities and relative isolation impaired easy 
social intercourse. The credit for forging a relative homogeneous moral and 
civic order linked to the values of hard work and individualism in predomi- 
nantly rural areas belongs to the churches and the schools. While denomina- 
tional rivalries simmered in rural, frontier Americe, community living 
depended on ecumenical cooperation and agreement about overarching 
Anglo-Protestant norms (Marty 1986, p. 150). 

The extraordinary success of the common-school movement, according to 
Tyack & Hansot (1982), was due to Protestant activists who successfully 
mixed arguments for public schooling with evangelical messages, a Republi- 
can ideology, and the efficacy of an ethic of work and discipline. By 1890, 
rural schools enrolled over 70% of the eligible population, and given the 
conditions of sparse settlement, schools were integrated by religion, ethnicity, 
class, and gender—a condition that hastened the cultural homogenization of 
rural populations. The automobile and the telephone were also important in 
bringing rural families into the orbit of urban life (Fischer 1992), and in 
facilitating the suburbanization of the urban middle-class. 
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Race, Religion, and Ethnicity 


There is no question that the general lines of this argument must be qualified, 
as the forces of class formation were not uniform over time or across 
geographical regions, and nonwhites and non-Protestants were often viewed 
as outsiders. Ethnic antagonisms increased during the late nineteenth and early 
twentieth century, and while nativism movements were driven by ethnicity 
and religion, the socioeconomic status of participants played a major role 
(Beisel 1990, Rosenzweig 1983). The consolidation of religion and ethnicity 
had far-reaching significance and helped to shape the contours of political life 
and the lines of social cleavage (Kleppner 1979, Olzak & West 1991, Blau 
et al 1993). The significance of race and ethnicity for class differences is 
beyond the scope of this paper. Suffice it to say, whereas white ethnics 
achieved rapid assimilation (Tyree 1991), Blacks did not. 

The earliest studies of the Black middle class report distinctively high 
degrees of commitment to education, strong familism, and a great emphasis 
on conventional values and standards of conduct (Frazier 1932, Dollard 1957, 
Drake & Cayton 1962). While these early researchers, particularly Frazier, 
criticized the members of the black bourgeoisie for their low self-esteem and 
too-eager accommodation to whites, more recent studies report far more 
positive racial attitudes (Kronus 1971). Yet, as Wilson (1987) suggests, 
members of the black middle class are still torn between the pressures of racial 
and class loyalties. 


THE TWENTIETH CENTURY TRAJECTORY OF THE 
MIDDLE CLASS 


The end of the Depression brought declining economic inequalities, the 
expansion of education, and further consolidation of the middle class and its 
culture of consumption. In fact, there was often denial of blue-collar working 
class membership after the Depression (Lynd & Lynd 1936), and studies show 
that blue-collar workers continued through the twentieth century to embrace 
middle-class identities, at least outside the work place (Halle 1984). What are 
these central identities and values? One view is that these now encompass the 
following: individualism and achievement, privacy, familism, consumerism, 
and conventionality (Bell 1976, Bellah et al, 1985, Gans 1988). Interestingly, 
scholars of the nineteenth-century middle class (for example, Williams 1961) 
stress instead the importance of growing tolerance, and equalitarian and 
collectivist values. 

Of increasing importance to middle-class culture was the cult of domestic- 
ity, which in part meant the centrality of the home for consumption and 
child-rearing (Trachtenberg 1982), a phenomenon which easily accommo- 
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dated itself to suburbanization (Wright 1981). But the cult of domesticity, 
and the suburban pattern of breadwinning male and home-bound homemaker 
in turn slowed the process of women’s integration in labor markets, making 
subsequent transitions in the family fraught with deep dilemmas for women 
and wide-spread social unease about the concept of the American family 
(Gerson 1991). 

The structural changes in the economy beginning in the 1970s (Harrison & 
Bluestone, 1988) entailed precarious conditions, notably increasing economic 
inequalities, downward mobility, the entrapment of the poor in the inner city 
(Lynn & McGeary 1990), and declining wages and productivity (Kuttner 
1984), thus undercutting the certainties of the middle class (Newman 1988, 
Blau 1993). Achievement values were called into question as the fastest 
growing segment of the service economy paid low wages, and education failed 
to translate into good jobs (Harvey 1989, Sullivan 1989). Norms about 
respectability and conventionality are put severely to the test in a society of 
cultural diversity and under conditions of decline (Wolie 1989). 

Whether middle-class discontent erupts as racism, religious fundamental- 
ism, or political conservatism is highly speculative, but it is clear that without 
economic recovery the cultural moorings and thé best tenets of the middle- 
class culture—tolerance, egalitarianism—will be undercut by mean spirited 
versions of other middle-class values—individualism (anti-collectivism) and 
conventionality (moral self-righteousness). 


SUMMARY 


Macrosociological theories that have emphasized the importance of immigra- 
tion, urbanization, and industrial development during the nineteenth century 
have neglected the emergence and transformation of the middle class in the 
United States. On the other hand, the great variety of quantitative case studies 
of individual cities that were carried out under the aegis of the “new” social 
history provide evidence that a middle class can be traced to the early decades 
of the century, and, indeed, played an important role in urban social life. 
While the community-level studies of social and cultural historians have 
illuminated the complexity of the social processes which led to the emergence 
of a distinct-——albeit shifting——middle class; taken as a whole, these studies 
do not provide a coherent understanding of its emergence. This is because 
each of these studies is focused on a relatively short period of time, a 
circumscribed geographic area, and, therefore, each captures particular phases 
in what was a dynamic and complex process. 

The thesis argued here is that the social composition of the middle class 
varied over time. Skilled artisans and merchants comprised the earliest middle 
class. Over the course of the century, small business owners such as retailers 
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and the petite bourgeoisie played a more important role, and at the end of the 
century, with the development of managerial capitalism, the middle class 
encompassed a more diverse grouping that included clerical, technical, and 
managerial workers as well as entrepreneurs and retailers. 

Because of both high rates of geographic mobility, which fostered inter- 
occupational and inter-ethnic contact, and improving economic conditions and 
upward mobility, a distinctive middle-class culture emerged, evident in social 
institutions such as schools, newspapers, leisure activities, and the family. 
These social institutions helped to shape—and were shaped by—codes of 
etiquette, public behavior, manners, and distinctive middle-class values. The 
importance of these institutions and the new urban culture for modulating the 
great differences among occupational, ethnic, and religious groups should not 
be underestimated. At the same time, the nationalization of middle-class 
culture and institutions in the early decades of the twentieth century under- 
mined its place-based character and, perhaps, the significant role of the middle 
class in particular cities. 

Structural changes in the economy in the 1970s have had considerable 
impact on the composition, character, and economic well-being of the 
American middle class and contributed to the growth of an urban underclass. 
Without fully understanding the implications of this ongoing process, we are 
only able to speculate about the possible outcomes of middle-class discontent 
and the relations between the relatively affluent and the urban poor. 
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Abstract 


A realist versus nominalist debate within the field of international migration 
questions whether refugees are fundamentally distinct from immigrants or 
whether the category is a social construction masking similarities with immi- 
grants. Contemporary refugee and immigrant flows conform to patterns of the 
world system. However, refugee migrations are caused by changes in the 
nation-state. Like immigrants, refugees organize migration through social net- 
works, but the composition of their networks and the effects of migration on 
social identity differ. In a host society, both populations adapt with household 
economic strategies that secure multiple income sources, although the state 
plays a greater role in the adaptation of refugees. The partial convergence of 
two migration forms once presumed opposite reveals general patterns in 
international migration and adaptation, supporting the nominalist perspective. 
The remaining differences suggest that refugees are primarily distinguished by 
their relationship to the state, thus supporting the realist perspective. 


INTRODUCTION 


Since the 1970s, refugees have posed a growing problem in the Third World 
and formed an increasing proportion of the migrants arriving to the First 
World. The Social Science Index listed an annual average of 15 journal articles 
on refugees from 1970 to 1974, 40 between 1975 and 1979, and over 80 
during the 1980s. 

Research on refugees accumulated with minimal conceptual elaboration: 
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Immigrants constituted an economic form of migration, refugees a political 
form. This simple but intuitive dichotomy, coupled with dramatic media 
coverage of refugee crises, sheltered refugee studies for some time. Soon, 
however, the literature faced the charge that “refugee” is simply a bureaucratic 
label applied by states for political motives, rather than a sociological category 
demarcating discrete groups and behaviors. As a result of this criticism, the 
two perspectives lead an uneasy coexistence within the field of international 
migration. One perspective views violence, flight, and exile as definitive of 
the refugee experience, the other considers “refugee” a social construction. 

These realist and nominalist perspectives take different forms depending 
on the aspect of the refugee process being examined: global migration patterns, 
the causes of refugee crises, the social organization of migration, and 
adaptation in a host society. Conceptualizing refugees’ relationship to the state 
is critical to evaluating the two perspectives, and thus to determining in what 
ways refugees are distinct from immigrants. 


REFUGEES IN GLOBAL PERSPECTIVE 


In 1951, the United Nations defined “refugee” as a person who “owing to a 
well founded fear of persecution for reasons of race, religion, nationality, 
membership of a particular social group or political opinion, is outside the 
country of his nationality and is unable or, owing to such fear, is unwilling 
to avail himself of that country.” For government officials, the key clause in 
this definition is “well founded fear of persecution.” For social scientists, the 
significant fact about refugees 1s that they break ties with their home state 
and seek protection from a host state through migration. If they cross an 
international border they become refugees, if they remain within their 
homeland they are displaced persons. 

The impetus for UN action in 1951 was the reconstruction of Europe in the 
aftermath of World War Two, particularly the 7 million Europeans unable to 
return to their homelands (Marrus 1985). Until amended in 1967 the resulting 
convention did not cover refugees in the rest of the world. During the interim, 
the foreign policy significance of refugees for developing countries and their 
numerical importance in developing countries increased (Zolberg et al 1989), 
signaling the mix of political and demographic factors that gave rise to 
nominalist and realist perspectives, respectively. 


The World System 


The world system perspective makes few distinctions between refugees and 
immigrants (Portes & Bach 1985). Migration from the core to the periphery, 
as occurred with European settler societies in North America (Bailyn 1986), 
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and immigration within the core, as in Europe following both World Wars 
(Marrus 1985), both reveal a mix of economic and political flows. Migration 
from the periphery to the core also includes both refugees and immigrants. 
Between 1975 and 1990, over 2.5 million refugees from Asia, Africa, and 
the Middle East gained residence in the First World (US Committee on 
Refugees 1992). France, Sweden, and Germany are the leading European 
host countries, and refugee arrivals began just after the ending of labor 
immigration in 1973-74. Refugees are part of continuing immigration flows 
in Australia, Canada, and the United States; Asia and Latin America have 
replaced Europe as the source of refugee admissions, paralleling changes in 
immigrant admissions (see Table 1). 

During the 1970s and 1980s, most refugee migrations occurred within the 
periphery. The Third World shelters 96% of the world’s 17.5 million refugees, 
suggesting that producing or accepting large numbers of refugees may be a 
component of peripheral status (see Table 2). Economic development and 
assistance to refugees are inseparable issues (Gorman 1986, Harrell-Bond 
1986, Keely 1981) because the “refugee” is an indicator of world system 
dynamics. 


Table 1 Refugee admissions to the United. States, 1945-1991 
Sources, Dfaz-Briquets & Pérez 1981, Refugee Reports 1989, 1991, 
US Congressional Research Service 1980, US ORR 1991 


Number 
Region and principal countnes (1000s) 
Europe 1,149 
Displaced persons, 1948—52* 352 
Hungary, 1956 —58 30 
Soviet Union, 1975—91 258 
Asia 1,091 
Vietnam, 1975-91 655 
Laos, 1975-91 223 
Cambodia, 1975 91 147 
Caribbean and Latin America 960 
Cuba, 1959—73 634 
Near East and South Asia 70 
Iran, 1980—91 35 
Afghanistan, 1980—91 29 
Africa 38 
Ethiopia, 1980-9] 27 
Total 3,308 
1945 —74 1,761 
1975 91 1,547 


* Primarily from Germany, Poland, and the Soviet Union 
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Table2 Refugees in the world system, 1991. Source: Europa Publications 1992, US Department 
of State 1992, World Bank 1990 


Principal countries GNP per capita Population Number of refugees Major refugee 


of first asylum (in $)* per refugee^ (1000s) population 
AFRICA 

Burundi 220 20 270 Rowanda 

Cote d’Ivoire 790 42 230 Liberia 

Ethiopia 120 100 492 Somalia 

Guinea 430 9 548 Liberia 

Malawi 160 8 982 Mozambique 

Sudan 340 28 729 Ethiopia 

Tanzania 160 92 260 Burundi 

Zaire 170 78 483 Angola 
EAST ASIA 

Thailand 1000 111 498 Cambodia 
NEAR EAST 

Iran 3200 14 3,620 Afghanistan 

Jordan 1240 4 980 Palestinians 

Lebanon 241 9 313 Palestinians 

Syria 1020 44 204 Palestinians 
SOUTH ASIA 

Pakistan 350 24 3,502 Afghanistan 
CENTRAL AMERICA 

Costa Rica 1790 85 35 Nicaragua 

Belize 1600 6 30 El Salvador 


* GNP per capita high income OECD countnes. 17,460, low income countries: 320 
* US population per refugee (total admissions 1975-1991), 161 


The Nation-State System 


The world system perspective cannot explain how the historical process of 
state formation and breakdown creates refugees in particular countries 
(Zolberg 1981). The world population is increasingly divided into citizens 
protected by their state and people who are not members of a functioning 
nation-state. The Kurds of Iraq and the Muslims of Bosnia are among a 
growing population of displaced persons who remain within their home state 
but are either persecuted or not protected by it. 

The increased salience of international institutions in protecting refugees is 
an indicator of the weakness of the nation-state system. UN refugee relief 
expenditures and the number of interventions in Asia and Africa rose 
dramatically beginning in the 1970s (Pitterman 1987). The United Nations 
hopes to supervise the repatriation of 3 million people to 21 countries, while 
continuing to assist 620,000 refugees who have already returned to their 
homelands (Refugees 1992). In countries like Somalia, relief organizations 
provide more services than the central government, which in some cases has 
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ceased to exist. Since 1948, the United Nations has funded social services 
for millions of Palestinians in the Gaza Strip, the West Bank, Jordan, and 
Syria. 

From these successive crises developed “the international refugee system” 
(Gallagher 1989), a network of national, international, and UN relief agencies 
(Gordenker 1983). It is no coincidence that an organization functioning as a 
global state is forming to manage refugee problems. The very concept of 
“migration” emerged from the growth of national bureaucracies (Dowty 1987). 
Bringing the state into refugee studies, particularly to explain the origins of 
refugee crises, provides a realist perspective on refugees. 


CAUSES OF REFUGEE CRISES 


Early studies believed that refugees and immigrants had different motives for 
leaving a homeland (Fagen et al 1968, Kelly 1977). Contemporary research 
on Salvadorans continues in this vein. There is evidence that Salvadoran flight 
results from political violence (Montes 1988, Stanley 1987), but also 
economic crisis (Jones 1989) and family reunification (Chavez et al 1990). 
From a nominalist perspective, this approach overemphasizes individual 
psychology and neglects the state interests served by distinguishing political 
from economic migrants (Bach 1987). 

Migration typologies reintroduced the realist perspective on refugees. 
Populations likely to flee political persecution include ethnic minorities, ~ 
armies engaged in civil war, and elites with ties to foreign powers (Kunz 
1981, Richmond 1988). Yet such typologies are still founded on the 
problematic distinction between economic and political migrants. Political 
conditions can cause migration when they result in deteriorating economic 
conditions, such as occurred with dictatorial looting in Haiti (Stepick 1987) 
and state dependent development in the Dominican Republic (Grasmuck & 
Pessar 1991). 

Analyzing the globalization of social conflict to explain forms of political 
violence is the most recent innovation ın refugee theory (Zolberg 1981). This 
approach has the value of making sense of economically irrational cases of 
middle class expulsion, such as the Jews from Spain in the 1490s and ethnic 
Chinese from Vietnam in the 1970s. Because the political factors in refugee 
crises are now better understood, analysis focuses on the politics of obtaining 
protection from one's native state or residence in a host state, thus blending 
realist and nominalist perspectives. 


Political Violence 


From World War Two to the end of the Cold War, decolonization and 
superpower conflict produced the largest number of refugees (Ferris 1987, 


48 HEIN 


Gordenker 1987, Zolberg et al 1989). Violence often accompanied indepen- 
dence from a European power, but conflict continued as contending nation- 
alities and tribes sought control of the state apparatus. Political realignments 
also resulted from the former Soviet Union’s intervention in Eastern Europe 
and Central Asia. The United States caused political instability in Southeast 
Asia and Latin America by supporting allied regimes and subverting the 
political process. 

The global social conflict perspective was developed before the end of the 
Cold War world and yet accounts well for recent refugee crises. The potential 
for large numbers of refugees in the former Soviet Union and its neighbors 
follows the pattern of nation-state breakdown. The displacement of 5 million 
people during the Iraqi invasion of Kuwait and the ensuing Gulf War combines 
the major causes of refugee flows. Between August 1990 and April 1991, 
950,000 immigrant workers from Jordan and Egypt and 360,000 from Asia 
fled the fighting; Saudi Arabia expelled 1 million Yeminis because of their 
government's support of Iraq; failed rebellion led 1.4 million Kurds to enter 
Turkey and Iran (US Department of State 1992). But the Iraqi crisis was novel 
for the rapidity and geographic extent of the refugee flows it produced, 
signaling the fragility of the national state as global interdependence increases. 
Examining the birth, death, and relationships among nation-states supports a 
realist interpretation of refugee crises. 


The Politics of Migration Policy 


The three solutions to refugee flight are repatriation, integration within the 
country of first asylum, and resettlement in a third country (Stein 1986). 
During the 1980s, most refugees simply remained in temporary asylum in 
countries adjacent to their homeland. Despite a constant flow of refugees to 
the West since World War Two, resettlement has been the least used solution. 
The transfer of 1.7 million Indochinese refugees to Western Europe, North 
America, and Australia is the most extensively organized global diaspora 
(Bach 1989). Family reunification and refugee status are now the primary 
means available to migrants hoping to settle in the West (Salt 1989). During 
the 1980s, Australia, Canada, and the United States admitted one refugee for 
every five immigrants. In 1990, one asylum seeker (migrant seeking refugee 
status) arrived to the United Kingdom for every two immigrants; the ratio was 
equal in France; and asylum seekers exceeded immigrants in Sweden. 
Germany received 700 asylum seekers a day during 1991 (Organization for 
Economic Cooperation and Development 1992). 

Foreign policy, domestic pressure groups, and fiscal concerns shape the 
state's selection of displaced populations for admission as refugees (Zucker 
& Zucker 1989). The influence of foreign policy on US refugee admissions 
is well documented (Loescher & Scanlan 1986, Tucker et al 1990, Zucker & 


REFUGEES, IMMIGRANTS, AND THE STATE 49 


Zucker 1987). Although states intervene to assure supplies of labor, refugees 
serve political rather than economic interests (Charlton et al 1988, Mitchell 
1989, Suhrke & Klink 1988). Analyzing refugee admission policies estab- 
lishes that refugee crises are caused in part by the social construction of who 
is and is not a “refugee.” 


THE SOCIAL ORGANIZATION OF REFUGEE 
MIGRATIONS 


The push-pull theory of international migration, which distinguishes between 
planned immigrant and spontaneous refugee movements, is the oldest of the 
realist perspectives on refugee migrations. In fact, refugees share some 
Migratory experiences with immigrants, particularly prior migration and 
reliance on social networks. However, the composition of immigrant and 
refugee networks differs, as does the role of migration in their social identities. 


Prior Migration and Social Networks 


Internal migration prior to international migration is common among refugees 
and immigrants. During the 1980s, indigenous Nicaraguans followed tradi- 
tional migration paths to sanctuary with sister communities in Honduras 
(Zinser 1991); rural Salvadorans migrated to urban areas before resorting to 
international flight (Mozo & Vasquez 1988). About 60% of Vietnamese 
refugees in the United States migrated at least once within Vietnam (Desbarats 
1986), and one third of those migrations were due to political violence 
(Dunning 1989). Indochinese refugees in the United States commonly engage 
in "secondary migration": moving from areas of initial settlement to states 
with large refugee communities, job opportunities, and higher public assis- 
tance payments (Desbarats 1985). Refugee flight from a homeland is usually 
preceded and followed by other migration forms. 

Refugees also use kin and friendship networks to navigate their passage to 
a host society just as immigrants do. Relatives and friends persuaded about 
one in five Indochinese refugees in Thailand to migrate (Institute Asian Studies 
1988). Relatives sponsored two thirds of Indochinese refugees to the United 
States only seven years after the first wave's arrival (US General Accounting 
Office 1983). Cubans arriving to the United States during the 1960s averaged 
eight relatives and friends already in the country (Portes & Bach 1985). This 
evidence on internal migration and social networks suggests a convergence 
of migration forms once presumed opposite by the realist perspective. 


The Direction and Composition of Refugee Networks 


Refugees permanently resettled abroad usually cannot return to their home- 
land, a definitive difference from immigrants. Circular migration among 
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Dominicans (Grasmuck & Pessar 1991), Mexicans (Massey et al 1987), and 
Puerto Ricans (Nelson & Tienda 1985) supports ethnic resilience, greater 
equality in gender roles, and a degree of development in the sending society. 
Naturalization rates increase when a migrants’ homeland is further away and 
socialist (Portes & Mozo 1985), two factors more closely associated with 
refugees than immigrants. Further research into circular versus linear migra- 
tion is likely to yield important insights into the behavioral consequences of 
migration patterns. 

Repatriation is an option for refugees who remain in countries of first 
asylum. Returning home after residing abroad is increasingly the only choice 
for many of the world’s refugees (Cuny & Stein 1989). The repatriation of 
Guatemalan, Nicaraguan, and Salvadoran refugees required international 
agreements, amelioration of conditions in the homeland, and declining 
prospects for integration in the countries of first asylum (Larkin et al 1991). 
These cases document circular migration among refugees, although interna- 
tional agencies, the home government, and community associations formed 
by the refugees themselves organized the migration. 

The demography of immigrant and refugee flows also differs, in part 
because “refugee populations include many persons who would not leave 
home on a voluntary basis” (Gold 1992:17). In addition, structural factors 
explain why refugee migrations are more frequently composed of families, 
while immigrant migrations are more often composed of individuals (Boyd 
1989). As evidenced by Mexicans and Cubans (Portes & Bach 1985), the 
family preference system of US immigration policy causes serial kin migra- 
tion, while refugee admissions are based on broad political and nationality 
categories that facilitate group migration. The political causes of refugee crises 
also produce distinct waves of migrants with similar departure dates and social 
characteristics (i.e. migration cohorts). The Cuban migration to the United 
States began with white and middle-class migrants and ended with a much 
larger proportion of black and working class migrants (Boswell & Curtis 
1984). Government and military officials comprised the first wave of 
Vietnamese refugees to the United States; later waves included more ethnic 
Chinese and unskilled workers (Gordon 1988, Nguyen & Henkin 1982). The 
consequences of cohort migration range from advances in American arts and 
sciences following the arrival of Central European refugees in the 1930s (Coser 
1984) to the political dominance of a conservative Cuban elite in Miami 
(Forment 1989). The directional and demographic differences between refugee 
and immigrant networks suggest the continued validity of a modified version 
of the realist perspective. 

The state also creates cohorts among immigrant flows. From the 1880s to 
the 1980s, cohort migration resulting from political changes in China, and 
alternately restrictive and liberal immigration policies in North America, 
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shaped the development of Chinatowns in the United States (Kwong 1987) 
and Canada (Lai 1988). Each transformation of the Polish state during the 
twentieth century restructured the social organization of the Polish community 
in the United States through the arrival of successive cohorts distinct in their 
social, economic, and political characteristics (Erdmans 1992). States are 
factors in both immigrant and refugee migrations, supporting the nominalist 
perspective, although the type of state intervention can be important for 
subsequent adaptation in the host society. 


Migration and Social Identity 


Tilly (1990:84) summarizes the relationship between the social organization 
of migration and ethnicity as: “networks migrate, categories stay put, and 
networks create new categories.” Yet categories also "create" migration 
because ethnic groups vary in the degree to which “migrant” becomes an 
identity. 

From the realist perspective, exile is a definitive event for refugees and 
has important consequences for social identity (Rose 1993). After founding 
the state of Israel, Jews explicitly conceptualized their refugee policy as 
“in-gathering exiles” (Korazim 1988) Most Palestinians have not experi- 
enced forced migration, yet exile remains a communal experience (Abu- 
Lughod 1988). Tribal Afghanis have an indigenous conception of asylum 
seeker that combines elements of Islamic history with traditional migration 
to kin in time of war (Centlivres & Centlivres-Demont 1988; Edwards 1986). 
For the Hmong of Laos, another stateless people, the term “refugee” has 
positive connotations (Hassoun & Mignot 1983). But for the Vietnamese, 
whose national identity is 2000 years old, refugee means separation from 
the homeland and has negative connotations. The ethnic Chinese of 
Cambodia and Vietnam, who have resisted assimilation for generations, 
prefer the term “overseas Chinese” to “refugee” because the latter implies 
rootlessness (Hassoun & Tan 1986). 

Conditions in the host society also influence the process of identity 
formation. Cubans in Miami identify themselves as “refugee,” “ethnic 
minority,” or “citizen” depending on the degree of nativism (Portes 1984) and 
the categories prioritized by the American political system (Maingot 1986, 
1987). Direct state intervention can change refugees’ conceptions of the family 
and ethnicity (Goldstein 1986, Hein 1991a, Skinner & Hendricks 1979). 
Refugees employed in resettlement agencies become brokers for refugees, or 
middlemen for their organizations, depending on their ethnicity, community 
politics, and the goals of social welfare policy on refugees (Gold 1992, Hein 
1988, Indra 1987). This research supports the realist interpretation that 
refugees are distinct from immigrants because the state remains a factor in 
the social organization of migration and the creation of migration identities. 
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Adaptive responses found among immigrants and refugees, such as household 
strategies to secure multiple income sources, suggest that distinct forms of 
international migration do not necessarily mean distinct modes of adaptation 
in a host society (a nominalist perspective). Nonetheless, refugee admission 
policies have real consequences for refugee adaptation, primarily a special 
status in the social welfare system. Whether or not refugees have a unique 
mode of adaptation depends on how the state is conceptualized. 


Demography, the Household, and Income Sources 


Demographic factors such as age and fertility rates explain much of immigrants 
and refugees' economic adaptation. The variable number of years in the United 
States is the best predictor of welfare dependency among Indochinese refugee 
families; the number of children is the second best (Rumbaut & Weeks 1986). 
Similarly, English proficiency, prior education, work experience, and length 
of residence predict refugees' employment, job status, and income in the 
United States (Haines 1987). Socioeconomic gains also vary by sex (Haines 
1986) and ethnicity (Desbarats 1986, Strand 1984, Strand & Jones 1985). 

Yet a strict demographic analysis often misses the importance of social 
networks in the adaptation process. The household rather than the individual 
is proving to be the best unit of analysis (Bach 1988). Indochinese refugees 
use relatives and friends to solve many social problems even when public 
programs are available (Forbes 1985, Hirayama & Hirayama 1988). But both 
sources of help decline over time, while self-reliance increases (Strand 1989). 
Sponsorship by an American family or resettlement agency, rather than by 
other refugees, increases employment, particularly for women (Bach & 
Carroll-Seguin 1986, Tran 1991). Although social networks interact with other 
factors in the adaptation process, the household remains the basic economic 
unit for refugees as well as immigrants, an important factor in the development 
of ethnic businesses (Gold 1989). 

Immigrant and refugee households also blend income from multiple 
sources, such as the market, informal economy, and means-tested social 
welfare programs, and more frequently include working spouses and other 
adults. This income strategy resembles that of ethnic minorities in the United 
States, but not the majority of natives (Tienda & Jensen 1988). Having wage 
earners and public aid recipients in a household is a crucial transition point 
for Vietnamese and Laotian refugees moving toward full self-sufficiency 
(Caplan et al 1989). Vietnamese women work in the informal economy to 
meet child care needs, and to receive Medicaid, a health care program only 
available to the unemployed, nonelderly poor (Kibria 1989). The adaptive 
flexibility among Indochinese refugees is largely due to women's ability to 
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bridge nuclear kin, extended family, and the ethnic community, on the one 
hand, and market, state, and informal sources of income, on the other (Benson 
1990, Kibria 1990, Rumbaut 1989a). 

Similar findings from research on immigrants (Grasmuck & Pessar 1991, 
Lamphere 1987) support the nominalist perspective on refugees. For example, 
as ethnic communities took root in France during the 1970s and 1980s, both 
immigrants and refugees experienced more problems with the family, life 
course, and state, but greater self-sufficiency in solving problems related to 
the job and housing market (Hein 1992b). There are limitations to using 
concepts from international migration to analyze migrants’ long-term adapta- 
tion in host societies (Hein 1991b). 


Social Welfare Policy 


The primary distinction between immigrant and refugee adaptation is the 
latter’s special status in the social welfare system. In the United States, 
immigrants, but not refugees, are ineligible for public assistance for five years 
after their arrival (North 1983). Refugees who meet the income, but not the 
family composition, criteria for AFDC, can receive Refugee Cash Assistance. 
The federal government reimburses the states for refugees’ public aid and 
medical costs (instead of partial payment) for a period of time, funds social 
services like language and job training programs, and provides a per capita 
grant to voluntary agencies that sponsor refugees (Hein 1992a). Many of these 
policies are currently in flux due to fiscal constraints. 

Whether resettlement policies encourage self-sufficiency or welfare depen- 
dency is the subject of considerable debate (Haines 1988, Refugee Policy 
Group 1983). Even when controlling for demographic factors, Indochinese 
refugees are unique among newcomers in having a public assistance rate above 
that of natives (Jensen 1988). Rates are particularly high in California (Bach 
& Carroll-Seguin 1986). 

The availability of public aid and social services provides refugees with a 
temporary alternative to entering mainstream labor markets and ethnic 
enclaves (Bach 1988, Johnson 1988, Rumbaut 1989b). An extensive study 
concluded that “only 2 percent of the [1384] households in our sample had 
managed to climb out of poverty after six months or less on cash assistance 
and without benefit of specialized refugee programs and services” (Caplan et 
al 1989:215). Indochinese refugees’ poverty and public assistance rates do 
decline with length of time in the United States (Fass 1986). However, it is 
difficult to determine the contribution of job and language training to increased 
self-sufficiency because public aid policies require refugees to register with 
a training program (Haines 1988, Strand & Jones 1985). Determining the 
importance of distinctive social welfare policies for refugees depends on how 
the state is conceptualized. 


54 HEIN 


The Role of the State 


US federal expenditures on Cuban refugee resettlement totaled $1.3 billion 
between 1962 and 1980 (Taft et al 1980). However, the definitive study of 
the Cuban enclave discounts state intervention as an important factor in the 
adaptation process (Portes & Bach 1985). The Cuban resettlement program 
dispersed the refugees, thus inhibiting the development of economic ties. It 
was the unplanned return of Cubans with savings to Miami that increased 
capital formation (Portes 1987). Nonetheless, Cubans’ benefited from state 
assistance, although the gain was not in opening businesses but in professional 
retraining (Pedraza-Bailey 1985). The Cuban case suggests conceptualizing 
the state as one factor in the context of reception (Portes & Böröcz 1989). 

Cross-national research on refugee resettlement indicates a much greater 
role for state intervention. States structure the adaptation of international 
migrants in uniquely national ways, as evidenced by the diversity among 
resettlement programs for refugees throughout the world (Chantavanich & 
Reynolds 1988). Comparison of these programs reveals how states create 
multiple forms of social citizenship by matching types of international 
migrants to types of mghts (Costa-Lascoux 1987, Haines 1989a, Lanphier 
1987, Majka 1991). The historical context of the migration and variation in 
state structure determines if the welfare state or the nation-state becomes the 
core institution managing the incorporation of refugees (Hein 1992a). Context 
of reception takes on a new meaning when the state is conceptualized as the 
host society’s mechanism for mediating the global forces embodied by 
international migration. 

State-society relations also affect the adaptation of immigrants. During the 
early twentieth century, a homeland’s status as colony or independent nation 
influenced Asian immigrants’ treatment in the United States (Chan 1990). 
Contemporary citizenship requirements in Western Europe and North America 
result from the role of nationalism in the founding of the nation-state and the 
prior existence of a colonial empire, among other variables (Brubaker 1989). 
Immigrant associations flourish in countries with a federalized social welfare 
system and a policy of encouraging immigration (Jenkins 1988). Countries 
with few laws licensing self-employment, and national rather than local (or 
nonexistent) policies promoting minority businesses, provide the greatest 
opportunities for immigrant entrepreneurs (Waldinger et al 1990). The state’s 
unique relationship with refugees is a subset of its larger function of 
reproducing the social order. 


CONCLUSION 


At one time, the distinction between economic and political migrants was the 
primary conceptual basis for distinguishing types of international migration. 
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The world system theory provided the first social constructionist critique of 
this realist perspective. The dichotomy is false, 1t suggested, because political 
conditions can cause the economic problems that lead to migration. In 
addition, more than 9096 of contemporary refugees reside in peripheral 
countries in Africa, Asia, and Latin America. The potential for large numbers 
of refugees from new peripheral societies in the former Soviet Union 
substantiates the view that refugees are indicators of status in the world system. 

A theory of global social conflict in the nation-state system supports a 
modified realist perspective. Refugee crises are a consequence of the political 
dynamics of state formation and transformation, and of increasing global 
interdependence, which erodes the stability of national states. Nonetheless, 
analysis of western refugee policy supports the nominalist perspective because 
who is or is not admitted as a refugee remains closely tied to foreign policy 
interests. The simultaneous rise of supranational state groupings and subnatio- 
nal ethnic mobilization requires a rethinking of citizenship, sovereignty, and 
the nation-state. 

Moving from global patterns in refugee flows and crises to the social 
organization of migration, a realist perspective from the push-pull theory 
distinguishes between planned migration by immigrants and spontaneous flight 
by refugees. In fact, both refugees and immigrants use social networks to 
make their passage from homeland to host society. But refugees resettled 
abroad generally cannot engage in circular migration, as immigrants can, 
although repatriation is increasingly the only option for many refugees. 
Refugee migrations more often consist of families and cohorts sharing similar 
social characteristics as a result of departure conditions, while immigrant 
migrations involve the consecutive arrival of individuals and subhousehold 
groupings. Refugees are more likely to view their condition as one of exile 
and to include migration in their social identity. The attitucinal and behavioral 
consequences of these distinctions remain to be explored. Yet a recurring 
finding is that political change and state intervention cause this variation in 
the organization of refugee and immigrant migrations. 

With respect to adaptation in a host society, the nominalist perspective is 
correct that "refugee" is often a remnant of migration rather than a form of 
adjustment to a new environment. The same demographic variables predict 
employment status and earnings for immigrants and refugees. Both popula- 
tions adapt as households and obtain income from multiple sources. Women 
perform the central function of bridging social networks and economic arenas. 
However, state intervention once again produces some important differences, 
particularly of access to the social welfare system. Cross-national research 
reveals distinct patterns in how states structure refugees' incorporation in a 
host society. Refugee status is a relationship to the state that takes a number 
of forms during the process of uprooting, migration, and adaptation. 


56 HEIN 


Literature Cited 


Abu-Lu ,J.U 1988, Palestinians. Current 
Socioi 36(2):61-9 

Bach, R. L. 1987. The Cuban exodus. See 
Levine 1987, pp. 106-30 

Bach, R L. 1988. State intervention m South- 
east Asian refugee resettlement in the U S. 
J. Refugee Stud. 1:38-56 

Bach, R. L. 1989. Third country resettlement 
See Loescher & Monahan 1989, pp. 13-32 

Bach, R. L , Carroll-Seguin, R. 1986 Labor 
is e participation, household composition 

p among Southeast Asian ref- 

a oen Migration Rev. 20:38 1-404 

Bailyn, B. 1986. The Peopling of British North 
America. New York. Knopf 

Benson, J. E. 1990. Household, migration, and 
community context. Urban Anthro. 19:9-29 

Boswell, T D, Curtis, J. R. 1984 The 
Cuban-American Experience. Totowa: Row- 
man Allanheld 

Boyd, M. 1989. Family and personal networks 
in international migration. Int. Migration 
Rey 23:638—70 

Brubaker, W. R., ed 1989 Immigration and 
the Politics of Citizenship in ope a and 
North America. Lanham. One 

Caplan, N., Whitmore, J. K., Gia M F H. 
1989. The Boat People and Achievement 
Ann Arbor Univ. Mich. Press 

Centlivres, P., Centlivres-Demont, M. 1988 
The Afghan refugee in Pakistan J Refugee 
Stud 1:141-52 

Chan, S. 1990. European and Asian immigra- 
tion into the U.S. in co ive - 
tive, 1820s to 1920s See sn Mel asin 
1990, pp. 37-75 

Chantavanich, S., Reynolds, E. B , eds. 1988. 
Indochinese Refugees: Asylum and Resettle- 
ment Bangkok: Inst. Asian Stud, 
Chulalongkorn Univ. 

Charlton, R , Farley, L T , Kaye, R 1988 
Identifying the of U.S refugee 
m asylum policy. J Refugee Stud. 1.237— 


Chavez, L. R , Flores, E T , Lopez-Garza, 
M. 1990. Here today, gone tomorrow? 
Human Org. 49:193—204 

Coser, L A. 1984 Refugee Scholars in Amer- 
ica. New Haven. Yale Univ Press 

Costa-Lascoux, J. 1987. L'insertion sociale des 
réfugiés et demandeurs d'asile en Europe 
Rev Eur Migration Int. 3.151—67 

Cuny, F., Stein, B. 1989. Prospects for and 
promotion of spontaneous repatriation. See 
Loescher & Monahan 1989, pp 293-312 

Desbarats, J. 1985 Indochinese resettlement in 
the U S. Ann. Assoc Am Geog 4:522-38 

Desbarats, J. 1986. Ethnic differences in adap- 
tation. Int. Migration Rev. 20:405—27 

Diaz-Briquets, S., Pérez, L. 1981. Cuba: The 


Demography of Revolution. Washington, 
DC. Pop. Ref. Bur. 

Dowty, A 1987 Closed Borders: The Con- 
temporary Assault on Freedom of Move- 
ment. New Haven: Yale Univ. Press 

Dunning, B. B. 1989. Vietnamese in America 
See Haines 1989b, pp 55-85 

Edwards, D. B. 1986 Marginality and migra- 
tion Int Migration Rev 20°313-28 

Erdmans, M P 1992. Emigres and ethnics 
patterns of cooperation between new and 
established organizations in Chicago's Pol- 
ish community. PhD thesis. Northwestern 
Univ. Evanston, Ill. 

Europa Publications. 1992. Europa World 
Book. London: Europa Pub 

Fagen, R R , Brody, R. A, O'Leary, T. J. 

968 Cubans in Exile. Disaffection and the 
Revolution. Stanford: Stanford Univ. Press 

Fass, S 1986 Innovations ın the struggle for 
self-reliance Int. Migration Rev. 20:351—-80 

Fawcett, J. T., Canno, B V, eds 1988 
Pacific Bridges: The New Immigration from 
Asia and the Pacific Islands Staten Island, 
NY: Center Migration Stud. 

Fems, E G. 1987. The Central American 
Refugees. New York: Praeger 

Forbes, S. S. 1985. Adaptation and Integration 
of Recent Refugees to the U.S. Washington, 
DC. Refugee Policy Group 

Forment, C. A. 1989. Political practice and the 
rise of an ethnic enclave Theory Soc. 19:47— 
81 

Gallagher, D 1989. The evolution of the 
intemational refugee system Int. Migration 
Rev. 23.579-98 

Gold, S. J. 1989. Differential adjustment 
among new immigrant family members. J. 
Contemp. Ethno. 17:408-34 

Gold, S. J. 1992. Refugee Communities: A 
Comparative Field Study. Newbury Park. 


Sage 

Goldstein, B. 1986. Resolving sexual assault 
In The Hmong in Transition, ed G. L. 
Hendncks et al, pp. 135—44. Staten Island, 
NY: Ctr Migration Stud 

Gordenker, L 1983. Refugees in developing 
countries and transnational organization An- 
nals Am. Acad. Polit Soc. Sct. 467:62-77 

Gordenker, L 1987. Refugees in International 
Politics. New York: Columbia Univ Press 

Gordon, L. W. 1988 Southeast Asian migra- 
tion to the U S. See Fawcett & Carino 1988, 

p 153-74 

Gorman, R 1986 Beyond ICARA II. /nt. 
Migration Rev 20 283-98 

Grasmuck, S , Pessar, P R 1991 Between 
Two Islands Dominican International Mi- 
gration. Berkeley Univ. Calif. Press 

Haines, D. W 1986 Vietnamese refugee 


REFUGEES, IMMIGRANTS, AND THE STATE 57 


women in the U.S. labor force. In Interna- 
tional Migration: The Female Experience, 
ed R. J. Simon, C B Brettell, pp 62-75. 
Totowa: Rowman Allanheld 

Haines, D. W. 1987. Patterns of Southeast 
Asian refugee employment Ethnic Groups 
7:39-59 

Haines, D. W. 1988. The pursuit of English 
and self-sufficiency. J Refugee Stud. 1 195— 
213 

Haines, D. W. 1989a Southeast Asian refugee 
resettlement in the U S and Canada Amer- 
asia J. 15:141—56 

Haines, D. W., ed. 1989b. Refugees as Im- 
migrants. Cambodians, Laotians, and 
Vietnamese in America. Totowa: Rowman 
Littlefield 

Harrell-Bond, B. E. 1986. Imposing Aid: 
Emergency Assistance to Refugees. New 
York: Oxford Univ. Press 

Hassoun, J P., Mignot, M. 1983. Le terme 
réfugié dans les langues Hmong et 
Vietnamienne. Asie Sud-Est Monde In- 
sulindien 14(1—2) 7—24 

Hassoun, J P , Tan, Y P. 1986. Les Chinois 
de Paris Terrain 7:34-44 

Hein, J 1988. State incorporation of migrants 
and the reproduction of a middleman minor- 
ity among Indochinese refugees Sociol Q. 
29:463--78 

Hein, J. 1991a. Indochinese refugees' re- 
sponses to resettlement via the social welfare 
system. In Asian Americans; Comparative 
and Global Perspectives, ed S. Hune et al, 
pp. 153-68. Pullman. Wash State Univ. 


Press 

Hein, J 1991b Do "new immi 
"new minonties"? Sociol 
34:61—78 

Hein, J. 1992a States and International Mi- 
grants. The Incorporation of Indochinese 
Refugees in the U.S and France. Boulder. 
Westview 

Hein, J. 1992b. The emergence of ethnic 
minorities 1n France Ethnic Groups 9 135— 
49 

Hirayama, K K., Hirayama, H. 1988 Stress, 
social supports, and adaptational patterns in 
Hmong refugee families. Amerasia J 14: 
93-108 

Indra, D M. 1987 Bureaucratic constraints, 
middlemen, and community organizations 
In Uprooting, Loss, and Adaptation: The 
Resettlement of Indochinese Refugees in 
Canada, ed K B. Chan, D M Indra, pp 
147-70. Ottawa. Canadian Public Heal 
Assoc. 

Institute for Asian Studies 1988. Thailand. A 
First Asylum Country for Indochinese Refu- 
gees. Bangkok Inst. Asian Stud., 
Chulalongkom Univ. 

Jenkins, S., ed. 1988. Ethnic Associations and 
the Welfare State Services to Immigrants in 


grants" become 
Perspectves 


Frye Countries New York: Columbia Univ 


Press 

Jensen, L. I 1988. Patterns of immigration and 
public assistance utilization, 1970-1980 
Int Migration Rev 22:51-83 

Johnson, P J 1988 Tbe impact of ethnic 
communities on the employment of South- 
east Asian refugees. Amerasia J 14 1-22 


Jones, R C 1989. Causes of Salvadoran 
mugration to the U.S. Geograph Rev 
79.183-94 


Keely, C B 1981. Global Refugee Policy: The 
Case for a Development Oriented Strategy. 
New York Pop. Council 

Kelly, G P 1977, From Vietnam to America 
A Chronicle of the Vietnamese Immigration 
to the U S. Boulder: Westview 

Kibria, N. 1989. Patterns of Vietnamese refu- 
gee women's wagework in the U S Ethnic 
Groups 7.291-323 

Kibria, N. 1990 Power, patriarchy, and gen- 
der conflict in the Vietnamese immigrant 
community. Gender Soc 4.9—24 

Korazim, J 1988. Immigrant associations in 
Israel. See Jenkins 1988, pp 155-82 

Kunz, E F. 1981. Exile and resettlement. Int 
Migration Rev 18:42-51 

Kwong, P 1987. The New Chinatown. New 
York: Hill & Wan 

Lai, D C 1988. Chinatowns Towns within 
Cites in Canada Vancouver: Univ. Br 
Columbia Press 

Lamphere, L 1987. From Working Daughters 
to Working Mothers: Immigrant Women in 
a New England Industrial Community. 
Ithica: Cornell Umv Press 

Lanphier, C M. 1987 Indochinese resettle- 
ment. See Rogge 1987, pp. 299—308 

Larkin, M A , Cuny, F C., Stein, B N 1991. 
Repatriation Under Conflict in Central 
America Washington, DC: Ctr. Immigra- 
tion Policy Refugee Assist 

Levine. B B , ed 1987. The Caribbean Exo- 
dus New York: Fraeger 

Loescher, G , Monahan, L , eds 1989. Refu- 
gees and International Relations Oxford 
Clarendon 

Loescher, G , Scanlan, J. A 1986 Calculated 
Kindness Refugees and America’s Half- 
"adi Door, 1945-Present. New York Free 


Maingot, A P. 1986 Ethnic bargaining and 
the noncitizen. In The Primordial 
Challenge: Ethnicity in the Contemporary 
World, ed J. Stack, pp 81-113. Westport: 
Greenwood 

Maingot, A P 1987 The American debate on 
immigration policy See Levine 1987, 
p 260-80 

Majka, L 1991. Assessing refugee assistance 
organizations in the US and the U.K J 
Refugee Stud 4 267-83 

Marus, M R 1985. The Unwanted: European 


58 HEIN 


Refugees in the Twentieth Century. New 
York. Oxford Univ. Press 

Massey, D, Alarcón, R, Durand, ]., 
González, H 1987 Return to Aztlan: The 
Social Process of International Migration 
from Western Mexico Berkeley. Univ. 
Calif. Press 

Mitchell, C 1989, International migration, 
international relations and foreign policy. 
Int. Migration Rev. 23.681—708 

Montes, S 1988. Migration to the US as an 
index of intensifying social and political 
crises in El Salvador. J. Refugee Stud. 1: 
107-26 

Mozo, S. M., Vasquez, J. J G. 1988 
Salvadoran Migration to the U.S. Washing- 
ton, DC’ Ctr. Immigration Policy Refugee 
Assist. 

Nelson, C., Tienda, M. 1985. The structunng 
of Hispanic ethnicity. Ethnic Racial Stud. 
8494 - 

Nguyen, L T., Henkin, A B 1982. Vietnam- 
ese refugees in the U.S J Ethnic Stud 
9.101-16 

North, D S 1983. Impact of legal, illegal, and 
refugee migrations on U S social service 
programs In U.S. Immigration and Refugee 
Policy, ed. M M Kritz, pp. 269-85. Lex- 
ington: Lexington 

Organization for Economic Cooperation and 
Development. 1992 Trends in International 
Migration. Paris Organ. Econ. Cooperation 
Dev 


Pedraza-Bailey, S. 1985 Political and Eco- 
nomic Migrants in America. Cubans and 
Mexcans Austin: Univ. Texas Press 

Pitterman, S. 1987 Determinants of interna- 
tional refugee policy. See Rogge 1987, pp. 
15-36 

Portes, A. 1984 The mse of ethnicity. Am. 
Sociol. Rev 49:383~-97. 

Portes, A 1987 The social origins of the 
Cuban entlave economy of Miami Soctol 
Perspec 30:340—72 

Portes, A., Bach, R. L. 1985. Latin Journey 
Cuban and Mexican Immigrants in the U.S 
Berkeley: Univ. Calif. Press 

Portes, A., Borocz, J. 1989. Contem 
immigration. Int. Migration Rev 23: 

Portes, A , Mozo, R. 1985. The political 
adaptation process of Cubans and r eth- 
nic minorities in the U.S Int. Migration 
Rev. 19.35—63 

Refugee De 1983. The meaning of 
refugee w dependency rates Migration 
News No. 325-34 

Refugee Reports. 1989, 1989 Statistical Issue. 
Refugee Reports 10(12).1—14 

Refugee Reports. 1991. 799] Statistical Issue. 
Refugee Reports 12(12):1-13 

Rupe 1992. Going home. Refugees 88-6- 

Richmond, A. 1988. Sociological theories of 


rary 
30 


international migration. Current Sociol. 
36(2).7—26 

Rogge, J R, ed 1987. Refugees A Third 
World Dilemma ` Totowa. Rowman 
Littlefield 

Rose, P I 1993 Tempest tost: exile, ethnicity, 
and the politics of rescue. Sociol. Forum 
§.5-24 

Rumbaut, R G. 1989a Portraits, patterns, and 
predictors of the refugee adaptation process. 
See Haines 1989b, pp 138-82 

Rumbaut, R G 1989b The structure of ref- 
uge Int. J. Comp. Public Policy 1.97-129 

Rumbaut, R G , Weeks, J. R 1986. Fertility 
and adaptation. Int. Migration Rev. 20428— 
65 

Salt, J 1989 A comparative overview of 
international trends and types, 1950-80 Int 
Migration Rev 23:431—56 

Skinner, K. A., Hendricks, G L 1979 The 
shaping of ethnic self-identity among In- 
dochinese refugees J. Ethnic Stud 7 25-41 

Stanley, W. D. 1987. Economic migrants or 
refugees from violence? Latin Am Res Rev. 
22 132-54 

Stein, B 1986 Durable solutions to develop- 
ing country refugees. Int Migration Rev. 
20.264—82 

Stepick, A 1987 The Haitian exodus See 
Levine 1987, pp 131-51 

Strand, P. J. 1984. Employment predictors 
among Indochinese refugees Int Migration 
Rev. 18.50—64 

Strand, P. J 1989. The Indochinese refugee 
experience See Haines 1989b, pp. 105-20 

Strand, P J., Jones, W 1985 Indochinese 
Refugees in America Problems of Adapta- 
tion and Assimilation Durham Duke Univ 
Press 

Suhrke, A., Klink F. 1988 Contrasting pat- 
terns of Asian refugee movements. See Faw- 
cett & Carino 1988, pp 85-102 

Taft, J , North, D , Ford, D. 1980 Refugee 
resettlement in the U S. See US Congres- 
sional Res Serv. 1980, Appendix Ill 

Tienda, M , Jensen, L 1988. Poverty and 
minonties In Divided Opportunities Minor- 
utes, Poverty, and Social Policy, ed G D 
Sandefur, M Tienda, pp. 23-62. New York. 
Plenum 

Tilly, C. 1990. T lanted networks 

ans-McLaughlin 1990, pp. 79-95 

Tran, T. V. 1991 Sponsorship and employ- 
ment status among Indochinese refugees in 
the U S. Int. Migration Rev 25.536—73 

Tucker, R W , Keely, C. B., Wrigley, L., 
eds. 1990. Immigration and U.S Foreign 
Policy Boulder: Westview 

US Committee on Refugees 1992 World 
Refugee Survey Washington, DC: US Com. 
Refugees 

US Congressional Research Service. 1980. 
Review of U S. Refugee Resettlement Pro- 


Sec 


REFUGEES, IMMIGRANTS, AND THE STATE 59 


grams and Policies. Washington, DC: US 
Congressional Res. Serv. 
US Department of State 1992 World Refugee 
Report. Washington, DC: US Dep State 
US General Accounting Office 1983 Greater 
Emphasis on Early Employment and Better 
Monitoring Needed in Indochinese Refugee 
Resettlement Program Washington, DC. 
US Gen Accounting Off. 

US Office of Refugee Resettlement. 1991. 
Report to Congress. Washington, DC: US 

. Refugee Resettlement 

Waldinger, R. W., Aldrich, H , Ward, R, & 
Associates. 1990, Ethnic Entrepreneurs: Im- 
migrant Businesses in Industrial Societies 
Newbury Park: Sage 

World Bank. 1990. World Tables Washing- 
ton, DC. World Bank 

Yans-McLaughlin., V ,ed 1990. Immigration 


Reconsidered History, Sociology, and Pol- 
itics New York. Oxford Univ. Press 

Zinser, A. A 1991. Repatriation of Guatema- 
lan refugees in Mexico See Larkin et al 
1991, pp. 57-114 

Zolberg, À. R. 1981 International migrations 
in political perspective. In Global Trends in 
Migration, ed M Knitz et al, pp. 3-27. 
Staten Island Ctr. Migration Stud. 

Zolberg, A. R , Suhrke, A., A , Aguayo, S 
1989 Escape From Violence Conflict and 
the Refugee Crisis in the Developing World. 
New York: Oxford Univ. Press 

Zucker. N , Zucker, N F. 1987. The Guarded 
Gate: The Reality of American Refugee Pol- 
icy. New York. Harcourt Brace Javanovich 

Zucker, N., Zucker, N F 1989. The uneasy 
troika in U.S refugee policy. J. Refugee 
Stud 2.359-72 


Annu Rev Soctol 1993 19.61—87 
Copyright © 1993 by Annual Reviews Inc All rights reserved 


THE SAD DEMISE, 
MYSTERIOUS 
DISAPPEARANCE, AND 
GLORIOUS TRIUMPH OF 
SYMBOLIC INTERACTIONISM 


Gary Alan Fine 
Department of Sociology, University of Georgia, Athens, Georgia 30602 


KEYWORDS. theory, social psychology, sociology of knowledge, history of sociology, quali- 
tative methods 


Abstract 


Symbolic interactionism has changed over the past two decades, both 1n the 
issues that practitioners examine and in its position within the discipline. Once 
considered adherents of a marginal oppositional perspective, confronting the 
dominant positivist, quantitative approach of mainstream sociology, symbolic 
interactionists find now that many of their core concepts have been accepted. 
simultaneously their core as an intellectual community has been weakened 
by the diversity of interests of those who self-identify with the perspective. I 
examine here four processes that led to these changes: fragmentation, 
expansion, incorporation, and adoption. I then describe the role of symbolic 
interactionism in three major debates confronting the discipline: the micro/ 
macro debate, the structure/agency debate, and the social realist/interpretivist 
debate. I discuss six empirical arenas in which interactionists have made major 
research contributions: social coordination theory, the sociology of emotions, 
social constructionism, self and identity theory, macro-interactionism, and 
policy-relevant research. I conclude by speculating about the future role of 
interactionism. 
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INTRODUCTION 


A standard ploy of textbook writers is to divide all sociology into three parts: 
functionalism, conflict theory, and symbolic interactionism. Without assess- 
ing the dubious contemporary justifications of the first two, what should be 
made of the last at our fin de siécle? Where stands symbolic interaction—that 
distinctively American sociological approach, derived largely from interpre- 
tations of the teachings of George Herbert Mead; named by Herbert Blumer, 
over a half century ago;’ inspired by the writings of William James, John 
Dewey, and Charles Horton Cooley; and provided an academic home in the 
first half of the twentieth century at the University of Chicago, by Robert 
Park, W.I. Thomas, and Everett Hughes. Has symbolic interaction become 
excessively fragmented or been incorporated into sociology, or has it 
triumphed in changing the discipline? As my title suggests, all three are 
partially true. ? 

Because of its intellectual tradition, organizational infrastructure, and active 
researchers, symbolic interaction is still too alive for an epitaph. Still, 
symbolic interaction ain't what it used to be. It occupies a very different place 
in the discipline from the time, 20 years ago, when it was labelled “the loyal 
opposition" (Mullins 1973)—a readily recognized social psychological, sub- 
jectivist, "micro," and qualitative stance. Such a view suggested that 
interactionism was “merely” reactive, rather than capable of creating a new 
vision. What is the place of symbolic interaction in contemporary sociology, 
and what role can it be expected to play in the future? 


THE "NEW LOOK" IN INTERACTIONISM 


Fragmentation, expansion, incorporation, and adoption—these processes 
together altered the character of symbolic interaction from a tight social 
network with a clear theoretical and research focus to a program with a slogan 
that increasingly masks a lack of coherence, the core ideas of which have 
been accepted and then taken for granted by the discipline. 


! There have been numerous histories of the development of symbolic 1nteractionism and its 
linkage with the Chicago school of sociology (see, for instance, Fisher & Strauss 1978, Harvey 
1987, Lewis & Smith 1980) 

*Symbolic interaction has been declared dead before, most notably in Nicholas Mullins’ (1973, 
p. 98) infamous—to interactionists—claim that symbolic interactionism’s influence “has come 
to an end." Institutionally, symbolic interaction with its several journals and vibrant organ- 
ization——the Society for the Study of Symbolic Interaction, founded in response to the obituary 
of Mullins and the harsh criticism of Huber (1973)—4s very much alive 
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In the early decades of the development of symbolic interactionism, its central 
themes were lucidly presented and easy to stereotype; these stereotypes, as 
discussed below, had some validity. Herbert Blumer, his colleagues at the 
University of Chicago, and students elsewhere articulated the symbolic 
interactionist perspective and, in effect, policed the boundaries. Interaction- 
ism, like any “new” theoretical orientation, had deep and varied intellectual 
roots (e.g. Stryker 1980, Shalin 1984, Lewis & Smith 1980, Rochberg-Halton 
1987), yet there was general agreement on its immediate parentage. While 
no unanimity existed as to the precise implications of the writings of George 
Herbert Mead (e.g. Miller 1973, Cottrell 1980), the primary source of the 
perspective of symbolic interactionism (and the meaning of Mead, for most 
interactionists) was the writings and teachings of Herbert Blumer. For many, 
Herbert Blumer was symbolic interactionism After the Second World War, 
the cohort of graduate students at the University of Chicago expanded 
enormously. These students, many heavily influenced by Blumer and also by 
Everett Hughes, represented a new generation of scholars, who during their 
careers deepened, expanded, and transformed interactionism, contributing 
important empirical studies and beginning the process of exploring new 
models of cultural and social criticism (Denzin 1992, pp. 10-13). 

Simultaneously a smaller group of interactionists was trained at the 
University of lowa under the leadership of Manford Kuhn. Kuhn emphasized 
the testable assumptions of Mead’s concept of the situated self, often through 
questionnaire measures such as the Twenty Statements Test (e.g. Kuhn & 
McPartland 1954). As a consequence, textbook writers divided interactionism 
into a “Chicago School” and an “Iowa School.” This division was easy but 
misleading, particularly after Kuhn’s death, the waning of interest in the 
Twenty Statements Test, and the changes in sociology at Chicago. The 
Chicago school was subdivided among those who emphasized the empirical 
interests of Everett Hughes and those who built on the theoretical infrastructure 
of Blumer. After Kuhn’s death, the lowa school played a lesser role in 
symbolic interaction, until its reformulation by Carl Couch and his students. 

The reorientation of sociology in the late 1960s into a discipline with a 
greater openness to critical and qualitative perspectives led to competing 
nonpositivist perspectives that accepted as givens—to greater or lesser 
degree—the key tenets of interactionism. Symbolic interaction served as a 
convenient and welcoming home for many sociological “malcontents,” 
frustrated by functionalist orthodoxy. Further, as the interactionists trained in 
the late 1940s and 1950s developed their own dialects and trained students, 
the range of qualitative, interpretive approaches expanded, particularly as few 
students received “pure” interactionist training. 
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With each generation, the core beliefs of interactionism are becoming 
increasingly muddied, even though there remain components of interactionism 
that most who affiliate with this perspective hold to—notably a broad 
acceptance, of Blumer’s (1969, p. 2) classic three premises of symbolic 
interaction: that we know things by their meanings, that meanings are created 
through social interaction, and that meanings change through interaction. 

The dissipation of the institutional centers of interactionist training—Iowa 
and Chicago, and later San Diego—militated against wide agreement on a 
core set of concepts, beyond broad premises. Symbolically the death of 
Herbert Blumer in 1986 closed the chapter in which it could be said that 
interactionism had a vivid identity. Although never providing a systematic 
statement of interactionist belief, Blumer served as an arbitor for what 
symbolic interactionism "really" meant (e.g. Blumer 1980). Even if not all 
accepted his interpretation (e.g. McPhail & Rexroat 1979, Stryker 1981), to 
reject it was to reject "Blumerian" symbolic interactionism. 

Once interactionism may have had a partially deserved reputation as 
parochial and in-bred, but this is no longer deserved. In its post-Blumerian 
age, interactionism might be called intellectually promiscuous. Contemporary 
"interactionists" blend their interest in "classical" interactionism (micro- 
sociological, nonstatistical, robustly relativistic, and proudly anti-positivistic) 
with virtually all sociological traditions. As a result interactionists have 
integrated a “Blumerian” approach with theoretical approaches linked to 
Durkheim, Simmel, Weber, Freud, Habermas, Baudrillard, Wittgenstein, 
Marx, Schutz, phenomenology, post-modern theory, feminism, semiotics, 
and behaviorism. What used to be a fairly narrow, tightly focused perspective 
now might be faulted for deemphasizing the traditional problems of situational 
definition, negotiation, impression management, and meaning creation that 
once animated symbolic interactionism." Revisionists, with some justice, have 
suggested that Blumer's writings (and those of other influential precursors, 
such as Znaniecki, Thomas, Cooley, and Park) reject a narrow and stereo- 
typical microview of interactionism, and claim that such criticisms (Reynolds 
1993) were never wholly accurate (Maines 1988, Tucker 1988). 

In its fragmentation, symbolic interactionism seems bound mainly by a few 
broad tenets, an effective organizational infrastructure, and some active 
publication outlets. Of course, this may be all that many perspectives share. 
The post-modern, post-structural textual readings and cultural studies of 
Norman Denzin (1986) and Patricia Clough (1992) seem light-years from the 
precise experimentation and theory construction of Peter Burke (1980) and 
David Heise (1979). It is symptomatic of the degree of fragmentation that 


? This does not mean that no interactionists are concerned with these “old-fashioned” issues, 
but rather that what had been dominant now is less so. 
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some of the Blumerian “old guard” would question whether any of these are 
“real” interactionists. Similarly the realist, descriptive ethnographies of Ruth 
Horowitz (1983) and Elijah Anderson (1978) are entirely dissimilar from the 
intensely personal and self-reflexive accounts of Carolyn Ellis (1991) and 
John Van Maanen (1988). 

Symbolic interactionism in the 1990s has a diversity that may vitiate its 
center. This splintering, of course, has benefits, in that diversity produces 
intellectual ferment. Yet, such a broadness raises the question of what, if 
anything, post-Blumerian symbolic interactionists share. Does a dominant 
model of symbolic interaction exist? Do the theorists who label themselves 
(or who are labeled) interactionists, belong to the same school? One response 
is that if a sufficient number of individuals label themselves or join an 
organization (the Society for the Study of Symbolic Interaction), then such a 
perspective exists. Yet, this degree of semi-coherence may raise questions 
about its justification as a perspective. 


Expansion 


Linked to the fragmentation of a once relatively unified perspective is a broad 
expansion of “legitimate” research topics. Symbolic interactionism was once 
criticized for a multitude of alleged and real sins. It was claimed to be apolitical 
(hence, supportive of the status quo), unscientific (hence, little more than 
tenured journalism), hostile to the classical questions of macrosociology 
(hence, limited to social psychology), and astructural (hence, fundamentally 
nonsociological). Critics might accept symbolic interactionist dominance over 
the study of face-to-face interaction and microrelations but reject its relevance 
elsewhere. 

Although symbolic interaction may have been limited in content and 
style—a claim partly true, but never as accurate as critics claimed (see Maines 
1988, Wood & Wardell 1983)—the same can hardly be said today. In 
responding to criticism, interactionists have developed concepts that connect 
to the macro and structural demands of sociology (e.g. Kleinman & Fine 
1979, Prendergast & Knottnerus 1993). Below I discuss contemporary 
research and theoretical developments, many of which (social coordination 
theory, macro-interactionism, applied sociology) are distinctly outside of what 
symbolic interactionists had once typically claimed for their domain. 

Recent attempts to link symbolic interaction to chaos theory (Young 1991), 
uses and gratification research (Altheide 1985), social ecology (Frese & 
Roebuck 1980), or to theories of the development of civilization (Couch 1984) 
mark the extent to which interactionists connect their approach to the broad 
span of academic knowledge. The belief that symbolic interaction 1s antago- 
nistic to mainstream social science has been refuted in the past two decades 
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and replaced by the confidence that this perspective contributes a new 
dimension to traditional topics. 


Incorporation 


With expansion of the topic areas of symbolic interaction has come increased 
borrowing from other disciplinary arenas. Symbolic interactionists have 
incorporated other theoretical approaches to invigorate their own perspective. 
Writings that explicitly attempt to blend symbolic interaction and cultural 
studies (Denzin 1992, McCall & Becker 1989) are a model of the incorporative 
turn. Likewise the call for a “synthetic interactionism” (Fine 1992a), blending 
diverse theoretical treatments of agency and structure, calls for interactionists 
to incorporate other models with a Blumerian perspective. 

The willingness to borrow ideas suggests the absence of a fortress mentality. 
Attempts to link interactionism with Marxist and critical theory (e.g. Batiuk 
& Sacks 1981, Ashley 1985), Parsonian theory (Alexander 1987, Sctulli 
1988), or to Vygotsky, Piaget, or Bruner and others in child development 
(Corsaro & Rizzo 1988, Winter & Goldfield 1991) all reveal the desire to 
learn from other intellectually vital sources. While these attempts at outreach 
may fragment coherence, a pragmatic approach should find such linkage 
bracing; after all, one should use the most productive tools. While a pragmatic 
approach denies that anything necessarily goes, it examines outcomes without 
presuppositions. 

Reading the past decade of Symbolic Interaction and Studies in Symbolic 
Interaction uncovers journals with a diversity approaching the discipline’s 
flagship journals. Without conscious planning, symbolic interaction has been 
repeatedly remade. One finds within interactionism, experimental, statistical 
analysis, secondary analysis of survey data, theoretical treatments that are 
grounded in literary criticism, ethnomethodologically inspired conversation 
analysis, European social theory, and policy-relevant applied sociology. In 
contrast, in the early years of the journals the thrust of most articles was 
limited to traditional concerns: the creation of meaning from interaction, the 
social creation of self and identity, and the history and theoretical contributions 
of the founders. While this “normal science” does continue, it has become 
less recognizable as the main task of the perspective. 


Adoption 


Just as interactionists have borrowed from others, so have they been borrowed 
from. During the 1980s the texts of George Herbert Mead were discovered 
by a generation of social theorists (Collins 1989, Joas 1985, Habermas 1987) 
who in general had little knowledge of classic symbolic interaction. The star 
of Goffman continued to rise. He was frequently acknowledged as the most 
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influential American sociologist of the twentieth century, outstripping Par- 
sons, Homans, and Blumer. 

It would be fair to claim, although difficult to demonstrate, that by the 
early 1990s many mainstream sociologists were accepting meaning construc- 
tion, negotiation, impression management, and labelling as components of 
their sociology. A case in point is the inspiration provided by the writings of 
John Dewey for Robert Bellah and his colleagues (1991) in their post-Tocque- 
ville analysis of America, The Good Society. Likewise the use of qualitative 
methods and interactionist concepts by the authors of Alan Wolfe's (1991) 
collection of essays on contemporary America, The Recentering of America, 
reveals the diffusion of the perspective. The New Institutionalism (Dimaggio 
& Powell 1991, Meyer & Rowan 1977) is also grounded on a cultural and 
qualitative understanding of how the conditions of work are linked to the 
analysis of organizational fields and economic structures. Meyer (1992), for 
instance, writes of the influence of Berger & Luckmann's (1966) interpretivist 
classic, The Social Construction of Reality, on his own research. This does 
not mean that structural theorists have accepted interactionism as their 
dominant theoretical model, or that they identify with the perspective, or even 
that they always recognize where their ideas come from, but increasingly 
interactionist constructs are integrated into the body of sociological thought." 
To be sure, some of this involves “the new Columbus syndrome,” renaming 
what has gone before, but in other instances the writers are very aware of 
their debts. Saxton (1989) argues persuasivelv that interactionist writers have 
a social scientific epistemology that solves generic problems of analysis in a 
post-positivist period. Contextualists and constructionists in social psychology 
(Gergen 1982, Shotter 1986, Rosnow & Georgeourdi 1986), and the new 
ethnographers and interpretive theorists in anthropology (Clifford & Marcus 
1986, Geertz 1980), despite their differences, had discovered an epistemo- 
logical tradition similar to that which symbolic interactionists had been 
developing for half a century. Similarly the revolution within communications 
theory owed much to interactionist analysis (Carey 1989). 

The appeal of interactionist concepts further blurred the boundary between 
those who are and are not interactionists. With these ideas spread so widely 
one can claim that interactionism as a bounded sociological perspective is in 
danger, even in its period of greatest triumph. The pages of book catalogs 
and leading journals are filled with scholarship that is compatible, though not 
identical with, interactionism: for instance, theories grounded in the writings 
of Bakhtin, Foucault, and Derrida. With the perspective so internally 


“It must be admitted that some scholars see the field as divided into hostile camps (e g. Lofland 
1990) I examine the same evidence and see a more pastoral condition In true interactionist 
fashion, there 1s no “true” condition, rather a set of analytical preferences 
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fragmented and so many others drawing upon it, the difference is narrowing 
between those who self-identify with interactionism and the many who do 
not, although they accept basic interactionist premises. 


INTERACTIONIST DEBATES 


Taken together, the processes of fragmentation, expansion, incorporation, and 
adoption suggest that contemporary symbolic interactionism is in an awkward 
period of triumph and growth, coupled with acceptance by the mainstream 
which may presage the disappearance of its unique contributions. The tension 
of interactionism’s increasing centrality is evident in a set of contemporary 
scholarly debates—debates that demand that interactionists address issues 
confronting the whole of sociology. While these debates overlap, each has 
been played out in the pages of national journals. Specifically I examine the 
contribution of the interactionist approach to: (1) the debate over the 
macro-micro link in sociology, (ii) the agency/structure debate, and (iii) the 
division between social realists and interpretivists. 


The Micro-Macro Debate 


Along with exchange theory and rational choice models, symbolic interaction- 
ism represents the dominant “micro”-version of sociology. Long before the 
micro-macro debate was named and joined, the connection between levels of 
analysis represented a major interactionist concern. The writings of Anselm 
Strauss and his colleagues (1964; see Strauss 1978, Fine 1984) in the early 
1960s presented the “negotiated order’ paradigm and explicitly placed 
organizational analysis on the interactionist agenda. Strauss believed that 
organization could be understood from the bottom up; that is, macro-structures 
could be understood from a micro-analytic foundation. Strauss did not ignore 
the effects of structure on meanings and interactions, but these were less his 
focus. The extension of research on organizational life led to a recognition 
that institutions had an important role in constraining meaning (Nichols 1991, 
Lynxwiler et al 1983), channeling interaction (Hall 1987), and "embedding" 
the construction of social forms (Gubrium 1992, Holstein 1993), even if these 
macro-structures did not totally determine meaning and interaction. 

In many ways the macro-micro debate was fought on interactionist turf 
(Shalin 1986), even if not all of the participants in the early stages were fully 
aware of its relevance. Erving Goffman's (1983) ASA presidential address, 
"The Interaction Order," provided an interactionist charter for confronting 
the traditional concerns of sociology with the social order. Likewise, the 
concept of "interaction ritual chains" was an attempt to argue that micro- 
interaction preceded structure (Collins 1981) and harkened back to Blumer's 
(1969) emphasis on knitting together lines of action. Others described the 
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sedimentation of meaning (Busch 1982) and macrostructure as collective 
behavior (Blankenship 1976, Bucher 1962). Interactionists have attempted to 
link the macro-micro levels by postulating a middle level: the meso-structure 
(Maines 1982). Here structure is mediated through individual actions, 
coordinated by patterns and expectations (see Levy 1982, Kleinman 1982, 
Pestello & Voydanoff 1991). It has been the particular contribution of the 
interactionist perspective to recognize that the mesoscopic level will permit 
sociologists to examine the social dynamics that permit institutions, organi- 
zations, economic order, and state regimes to compel commitment or 
obedience from individual actors. 

Ultimately interactionists, like others in the debate, concluded that a fixed 
distinction between levels is misleading (Wiley 1988, Law 1984), suggesting 
that institutions of all sizes can be analyzed using similar analytical tools. 
Some argue for a seamless sociology which recognizes that "separate" levels 
are actually intertwined and indivisible, with micro analyses implicated in 
macro ones, and vice versa (Fine 1990b). The debate has been important in 
its attempt to bridge theory groups, bringing micro-sociologists into intellec- 
tual and personal contact with macro-sociologists, breaching subdisciplinary 
isolation. One reason it can plausibly be claimed that symbolic interaction 
has disappeared, although not by name, is the success of the argument that 
all levels of analysis must be considered in an adequate analysis. The rare 
micro-sociologist (whether in exchange theory, ethnomethodology, or sym- 
bolic interaction) disdains interest in questions of larger institutions. In turn, 
most macro-sociologists (structuralist, Marxist, or institutionalist) now accept 
a vision of structures ultimately grounded on the actions of participants, even 
if they do not emphasize the power of the actor as much as interactionists. 


The AgencylStructure Debate 


Few issues are so central to the symbolic interactionist perspective as personal 
agency. The claim that interactionists only believe in agentic choices has been 
a frequent criticism of the perspective. Still, the balance between structure 
and agency is at the heart of the interactionist approach to social order. 
Ultimately social order depends on how agents confront, use, manipulate, and 
remake structure (Dawe 1978), directly and through mediating individuals, 
and how social institutions take individuals into account. The interactionist 
recognizes that much of the world is not of an individual's making (e.g. 
systems of patriarchy or class) and can only be understood in the context of 
the circumstances in which these social realities are expressed. 

As is true with so many newly discovered issues, the linkage of agency 
and structure in interactionism has a long pedigree (Baldwin 1988)—a concern 
implicitly addressed in the writings of Mead, Cooley, Blumer, Goffman and 
others. How do individuals negotiate the realities that are structured—that can 
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be ignored only by those willing to accept severe consequences-—and how do 
structures determine what actors can or will do? Concepts such as obdurate- 
ness, constraint, negotiation, sedimentation, symbolization, identification, 
ritualization—each grounded in traditional interactionist analysis—connect 
the actor with the limits of choice (Fine 1992a). The goal is the Goffmanian 
one of developing an understanding of the “interaction order” that does justice 
to both order and interaction, asking not which definitions are possible, but 
also what definitions are likely and what the consequences are for those who 
ignore the definitions. 

Fundamentally the perspective depends not on individual action but on 
collective meaning creation—on collectivities of any size, although there is 
debate on the extent to which meaning is continuously being generated. For 
instance, an interactionist view of collective and crowd behavior seems to 
drain from individual agents much of their individuality, providing the crowd 
with the ability to transform actors (McPhail 1989). The "structure" (sedi- 
mented individual meanings) is powerful. Attempting to link human action 
with the obdurate reality of the environment—a structure external to the 
actor—Weigert (1991) speaks of a type of behavior he labels "transverse 
interaction." Actors recognize the physical environment as a symbolic other 
and use this understanding to structure their interaction with a "generalized 
other." The relationship between actors and objects is not just meaningful 
but, in a peculiar sense, can be said to be interaction (Cohen 1989). Because 
interaction is set within institutions and responds to obdurate reality, an 
adequate interactionist analysis must take structure into account. Ultimately 
"interactionism is both a theory of experience and a theory of social structure" 
(Denzin 1992, p. 3). 


The Social Realist/Interpretivist Debate 


Interactionists have often been often described, and sometimes describe 
themselves, as fundamentally antiscientific and antipositivist. In a sense this 
is true, but such a claim misses the diversity of the perspective and 
simultaneously ignores the fact that those who questioned standard quantitative 
methods, such as Mead, could see themselves as "scientific." Thus, symbolic 
interactionism has been "haunted by a double-edged specter" (Denzin 1992, 
p.2). While arguing for an interpretive, subjective study of human experience, 
interactionists also hope to create a science of human conduct, a social realist 
approach based on natural scientific criteria. While this debate has been 
exemplified in the writings of specific scholars (e.g. Manford Kuhn vs Herbert 
Blumer), it is also evident within the "privileged texts" of the perspective, 
such as Blumer's (1969) Symbolic Interactionism. How can one be objective 
while still being subjective? All attempts to resolve the dilemma are ultimately 
unpersuasive, and the debate continues. 
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Symbolic interactionism is more methodologically diverse than it is 
frequently given credit for, particularly if one is willing to include inter- 
actionists who study self-concept and identity formation (Rosenberg 1979, 
Burke 1980). Others disdain these data collection techniques, and following 
literary critics, analyze texts as rhetoric (e.g. Gusfield 1976). 

The methodological conflicts between the so-called Chicago and lowa 
schools still reverberate (Falk & Anderson 1983), and some claim that 
symbolic interactionism is divided between humanists and positivists 
(Warshay & Warshay 1987). The issue of the inevitability of causality 
(Lindesmith 1981) continues to divide interactionists. However, these divi- 
sions may be exaggerated as most interactionists accept the systematic 
collection of data, whether through in-depth-interviewing, ethnography, 
introspection, or surveys. 

Social realists believe that one can collect and analyze cata that reflect with 
some fidelity social reality (Farberman 1991, p. 477), while radical subjec- 
tivists and post-modernists see data as a discursive strategy, a second-order 
reality, a text that must continually be questioned and subverted (Clough 1989, 
Schneider 1991, Richardson 1992). 

The chasm between an interpretivist and a social realist approach is central 
to understanding the diversity of contemporary symbolic interactionism. Both 
approaches have become more sophisticated in their theoretical development 
and in their methodology, and they are progressing in substantially different 
directions. Are they still wings of a single perspective in a meaningful sense 
when they cannot agree on epistemology: whether the world is ultimately 
knowable? 


SYMBOLIC INTERACTIONIST DOMAINS 


Ultimately schools are known for what their practitioners accomplish: they 
are activity systems. The growth of qualitative, interpretivist, and interaction- 
ist approaches to sociology depends on the power of lines of research. À brief 
survey cannot hope to do justice to all of the active empirical lines, but I 
examine some of the more significant areas: (1) social coordination theory, 
(ii) emotion work and experience, (iii) social constructionism, (iv) the creation 
of selves, (v) macro-interactionism, and (VI) policy-relevant interactionism. 


Social Coordination Theory 


The explication of universal processes that explain social interaction through 
formal, generic principles is a long-standing goal for social realists who strive 
for systematic principles of knowledge (Prus 1987). As Rock (1979, p. 53; 
see Zerubavel 1980) notes, symbolic interaction 1s deeply indebted to 
Simmelian theory and methods. 
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Perhaps the most compelling and ambitious on-going research program 
within interactionism, specifying the generic principles of collective action, 
is that of Carl Couch and his students. For the past 25 years they have explored 
how social units coordinate their activities. This theory of social coordination 
provides interactionism with a set of universal social principles, which Couch 
(1992, p. 130) likens to both geometry and the chemical periodic table. 
Through a series of edited volumes (Couch & Hintz 1975, Couch et al 1986), 
books (Couch 1989), and articles, largely in interactionist journals and 
annuals, they have elaborated the processes and the conditions of co-presence. 

Couch (1984, p. 8; Miller et al 1975) argues that for a cooperative act to 
unfold, interactants must: (i) establish co-presence, (11) demonstrate reciprocal 
attention, (iii) reveal mutual responsiveness, (iv) create congruent functional 
identities, (v) build a shared focus, and (vi) devise a social objective. In 
establishing a sociable relationship, interactants establish a shared past and a 
projected future (Katovich & Couch 1992, Maines et al 1983). A social 
relationship or group develops traditions and an idioculture (Fine 1979, Wiley 
1991). The existence of similar social pasts (proximal and distal, common 
and shared) permits actors rapidly and un-self-consciously to reconfigure their 
responses. Couch’s efforts in formulating principles of coordinated action 
parallel Mead, Cooley, and Blumer’s concern with adjusting to specific and 
generalized others. Indeed, some experimental research, arising from the 
interactionist tradition, suggests that the coordination is remarkably subtle, 
producing unintended temporal symmetry in microsocial relations (Gregory 
1983). 

Couch’s goal is ultimately nothing less than to create a sociology grounded 
on the micro-level, wherein dyads, triads and other groups create social 
processes, recognizing that invariant processes provide the building blocks of 
a formal sociological theory that will provide understanding of how dyads 
and larger groupings coordinate their actions in organizations and interaction 
sequences. 

Methodologically Couch (1987) and his colleagues have a different stance 
from many symbolic interactionists. In some measure they transcend the 
apparently sharp distinction between qualitative and quantitative research. 
While their data collection may occur in the laboratory (Couch & Weiland 
1986, Katovich 1987) or in field settings (Seckman & Couch 1989, Katovich 
& Diamond 1986), the intent is the same. They collect specimens of social 
transactions produced by dyads, triads, and in a few cases larger collectivities 
in situations that are structurally similar. Each laboratory simulation (e.g. of 
negotiating teams) serves in effect as an ethnographic site, but rather than 
exploring what grounds that interaction situationally in local culture, they 
extract universal principles. The “ethnography” of the social coordinationist 
may seem casual in that it is not based on lengthy immersion in the setting, 
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but rather is designed to observe similar, routine interactions, a form of 
theoretical sampling. It is not surprising that social coordination theorists are 
particularly interested in routine saleswork (e.g. Katovich & Diamond 1986, 
Prus 1989)—these situations recur and provide opportunities for assessing 
generic concepts. Because these researchers search for universals, rather than 
comparative statements, statistical tests are not relevant. The large-scale, 
immersion ethnography favored by the grounded theorist also is not called 
for, in that learning about any particular group and its local situatedness is 
not crucial, nor are subjective accounts. 


Emotion Work and Experience 


In the past 20 years sociologists have discovered emotion as a subject 
(Gordon 1981), a topic largely situated within a broadly defined symbolic 
interactionist perspective. Emotions are understood as embodied, a class of 
experienced reality (Denzin 1984a); as a form of cognitive evaluation, as 
affect control; and as arising from the social world, as part of the dramaturgical 
skills that individuals use to cope with the social order (Hochschild 1983). 
Interactionists treat emotions as "lived experience" (part of a post-modern, 
phenomenological turn), as "cognitive constructions," linked to social mean- 
ings, and as “emotion work" (an interactional strategy of impression manage- 
ment). While these approaches to emotion are distinct, some promising 
attempts have been made to integrate them (Johnson 1992, Scheff 1983, Thoits 
1989). 


EMBODIED EMOTION Emotions are experienced by human bodies, not only 
filtered through social demands, although the social context of the emotions 
ultimately determines what is felt. "Emotion" in Norman Denzin’s (1985, p. 
225) terms is "self-feeling"—affecting a lived body, given meaning by an 
actor in a social world. Denzin argues that emotion is a primary window into 
the self, simultaneously constructed to situate its meaning in community. 
Temporality (Flaherty 1987, 1992, Fine 1990a), physical contact (Denzin 
1984b), and the “natural” environment (Fine 1992b, Mitchell 1983, Weigert 
1991), while not emotion as such, are also states that are directly experienced 
and embodied, and are connected to primary emotions (e.g. boredom, fear, 
exhilaration). Those who study the chronically ill and dying (Charmaz 1991) 
find that the embodied feelings of dis-ease, and not only the social definitions 
given to “the sick,” provide social meaning and identity transformations. 

To understand emotions as a primary reality, some researchers have 
emphasized the value of self-reflection. In an influential and controversial 
article, Carolyn Ellis (1991) calls for “sociological introspection” to help 
comprehend what emotions feel like: in her case the dramatic personal effects 
of her partner’s chronic illness and death. Such uncompromising honest 
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accounts can be uncomfortable reading, but they hew to the demands of social 
phenomenology: to capture lived experience. 


AFFECT CONTROL A second interactionist approach to emotions, affect 
control theory (Heise 1979, Smith-Lovin & Heise 1988, Robinson & 
Smith-Lovin 1992), results from a cross-fertilization between cognitive social 
psychology and "structural" symbolic interactionism (especially identity 
theory—Stryker 1981). Drawing from within and outside classical interaction- 
ism, these researchers emphasize measurable social identity. Through partic- 
ipating in situations, actors adopt social identities that mark their self-defined 
relationships to those with whom they interact, and a failure to establish such 
relations leads to emotional distress (Thoits 1983). The meanings of these 
identities are analyzed on three core dimensions: evaluation (good/bad), 
potency (powerful/powerless), and activity (active/inactive). By measuring an 
actor's definitions through experimental research or questionnaires, research- 
ers examine change (or deflection) of these definitions as a result of 
independent variables. Affect control theory argues that actors construct events 
to confirm their meanings about self and others, minimizing deflections. 
Emotions are signals about the extent to which events confirm or disconfirm 
identity. One may generate emotions from an identity or from the character 
of the situation. Emotional responses are a function of both the situational 
definition and the recognized social identity of the definer. In this model, the 
dynamics are primarily cognitive, and in a real sense emotions emerge from 
definitions, rather than being generated as a direct consequence of external 
stimuli. 

In contrast to the experiential approach to emotions, affect control theory 
demands precise experimental methods—a far cry from what has been taken 
as the traditional interactionist methodological dicta. To the degree that affect 
control theory depends on symbolic interactionist assumptions of the social 
construction of self, it provides another indicator of the blurring of lines 
between interactionism and "mainstream" sociology. 


EMOTION WORK A third approach explores emotions as social constructions 
and strategies for impressing others. These researchers are less interested in 
how emotion is experienced or internally generated than in how emotion is 
performed as a consequence of the demands of social situations and culture; 
this approach draws heavily on dramaturgical analysis. 

From Goffman's (1959) early writing onward, the examination of social 
life as dramatic and designed to persuade has been critical to interactionism. 
This approach, generally known as the "strategic paradigm" (Lofland & 
Lofland 1984), has focused on how social actors manage their performances 
in their verbal, paraverbal, and nonverbal aspects. Even in Goffman's early 
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writing, he emphasizes that emotions are strategic, and that social actors are 
socialized in their use. Emotions are linked to identity work (Snow & 
Anderson 1987, Clark 1987). Just as individuals select emotions to display, 
so can other actors channel the propriety of certain emotions, such as grief 
(Rosenblatt 1988, Lofland 1985) Organizational demands and occupational 
roles shape how people express emotion and sometimes may even affect how 
they feel (Hochschild 1983, Zurcher 1985, Gubrium 1992). In this view, 
emotions are learned behavior and controllable, and “feeling rules” exist that 
determine when and which emotions will be performed. 


SOCIAL CONSTRUCTIONISM 


Symbolic interaction has long been embedded 1n the examination of social 
problems, certainly from Mills's (1942) analysis of the social rhetoric and 
accounts surrounding the definition of social problems. Labelling theory 
(Becker 1963) developed from interactionist constructs, recognizing that the 
public 1s at least as much involved in the creation of deviance as is the 
purported deviant. Over the past 30 years, labeling theory has been criticized, 
expanded, and altered, and is a subtype of theories that have come to be 
known as “social constructionism” (Schneider 1985). Labeling theory is a 
“micro” variant on the Durkheimian emphasis on the need of a society to 
establish boundaries (Erikson 1963), focusing attention on the reaction of 
social actors, rather than on societal understandings. Ultimately the inter- 
actionist approach to “social problems,” and indeed to all spheres of 
knowledge, is to examine the ways in which boundaries are established and 
defended (Zerubavel 1991, Gieryn 1983). 

The social constructionist approach provides a means by which inter- 
actionists address the institutional formulation of social problems. Why are 
some patterns of action defined as “problematic,” while others are “normal- 
ized”? In the classic formulation (Spector & Kitsuse 1977): How are social 
problems constructed? The “social constructionist” approach permits inter- 
actionists to examine dynamic, historical processes affecting the social system, 
such as “the medicalization of deviance” (Conrad & Schneider 1980). 
Constructionism has come to dominate social problems theory, but its 
elaboration has developed its own splits and theoretical controversies (Holstein 
& Miller 1993). For instance, there is active debate between those who 
emphasize that all meaning (and, hence, the existence of “objective” condi- 
tions) should be problematized (Woolgar & Pawluch 1985), suggesting that 
sociological knowledge is as constructed as the rhetonc of social problems 
actors, and those who accept the existence of objective conditions, while 
wishing to focus on the processes by which some of these conditions become 
a part of public debate (Best 1989), suggesting that sociologists can, to some 
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degree be “honest brokers.” Cultural conditions (Fine & Christophorides 
1991), institutional realities (Hilgartner & Bosk 1988), and the role of claim 
makers (Pfohl 1977) all contribute to determining what enters public debate. 
The constructionist view has become so dominant that it is rare to read a 
sociological treatment of a social problem that does not allude to the way in 
which the recognition of a social problem is a function of extra-objective 
criteria. 

Constructionism is not limited to the examination of social problems but 
applies to the creation of all social life, tied to W.I. Thomas’s classic claim 
that situations must be defined and that these choices have real consequences. 
The social construction of everything is not merely a joking claim; it is 
fundamental and increasingly accepted as part of the sociological worldview. 
For instance, the interactionist view of psychosis (Rosenberg 1984) is based 
on the patient’s inability to take the role of the other and make “proper” 
(consensual) attributions—in other words, to be unable to construct the social 
world in a manner that coordinates with the perspectives of others. We use 
whatever is handy to construct our own selves—including our physical 
locations or clothing—and to construct identities and character of others as 
well (Stone 1962, Weigert 1986, Hood 1984, Davis 1992). Any piece of 
symbolic driftwood is added to these constructions. The active model of social 
interpretation is a well-recognized feature of much contemporary sociological 
writing. 

Erving Goffman’s (1974) treatment of frame analysis, drawing on Gregory 
Bateson, Kenneth Burke, and W.I. Thomas, examines how actors know what 
class of activity is going on (e.g. teasing, experimentation, fraud, or fantasy 
play). Frame analysis probably has the most dramatic impact in social 
movements research. Scholars in this tradition (Gamson et al 1982, Snow et 
al 1986), while not using “frame” precisely as Goffman did, argue that the 
rhetorical techniques by which movement entrepreneurs define a claim has 
real effects on public responses and organizational growth. 

Interactionists argue that even the past is constructed—time and history are 
not immutable, but their meaning results from situational appropriateness and 
the activities of moral entrepreneurs. Thus, interactionists turn backward 
trying to understand historical events and, as significantly, how historical 
events are given weight. That temporal categories have not always had the 
same meaning was persuasively argued by Eviatar Zerubavel (1981). Holi- 
days, weeks, and years are not given by a temporally fixed universe but are 
constructed socially, with attendant symbolic meaning. So, too, we construct 
the memoralization of events and persons (Schwartz 1987, Wagner-Pacifici 
& Schwartz 1991), which can be the source of drama (Gross 1986), communal 
identification (Billig 1991), contention (Gregory & Lewis 1988), and self 
(Davis 1979). Everything is potentially up for grabs; that everything isn’t 


SYMBOLIC INTERACTIONISM 77 


actually heeds the obdurate character of structure, power, and sedimented 
meanings. 


THE CREATION OF SELVES 


Sociological social psychology, marginalized in the 1970s, has reemerged to 
contribute to the broadening of the discipline. Nowhere is this more evident 
than in the rejuvenation of the sociological study of the self, identity, and 
social role. The development of the social and symbolic self, a root issue of 
symbolic interaction from James, Cooley, and Mead, is central to interaction- 
ist research and theory and includes such issues as self-esteem, self-feeling, 
self-concept, identity work, and self-presentation. 

Symbolic interactionism, as practiced by those sociologists trained by 
Everett Hughes at the University of Chicago in the late 1940s and early 1950s, 
tended to deemphasize self in favor of situation; the sociology of Erving 
Goffman, implying that there was no deeply held “real” self, only a set of 
masks, was a prototype for this view. Yet, despite the attention to situation, 
interactionists such as Ralph Turner (1976, 1978) emphasized that the creation 
of the self results from social and cultural trends. Hewitt (1989), for instance, 
argues that a basic conflict appears in American selves between individualism 
(independence) and community participation (interdependence). 

While interactionists hold that no “real, true, core” self can be found, 
analyses of how selves develop are part of interactionist analysis—both by 
interpretivist theorists associated with a post-modern literary analysis, and by 
social realists who are more closely connected to experimentation and 
hypothesis testing. Interactionism pictures the self as symbolic, situationally 
contingent, and structured. 

Describing the interpretivist view as “post-modern” does not do full justice 
to this approach, which is grounded in equal measure in feminist theory. The 
shared interests of interactionists and feminist researchers emphasize the 
“gendered” quality of self—that is, self is not biologically given, but is created 
from social demands and responses to those demands (Wiley 1991). The 
recognition of gender has affected research in all domains of sociology, but 
nowhere more significantly than in self-research. Given the frequent, though 
not universal, argument that gender is socially constructed (Kaufman 1991, 
Krieger 1983), feminist sociology is a natural ally for interactionism (see 
Deegan & Hill 1987). 

Self is generated through rhetoric and story-telling by oneself (Denzin 1987, 
Miller 1991) or others (Adler & Adler 1991: ch. 6) and the manipulation of 
other symbols (Schwalbe 1983). The literary creation of self has become 
prominent in interactionist writing (Richardson 1992, Rambo Ronai 1992). 
The self is the text. Some playfully link Mead's "T" to "irony" (Tam 1984). 
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The literary, verbal, symbolic construction of self is one pole of the 
interactionist approach to self. 

A second pole is identity theory, which agrees that self is constructed, but 
rather than seeing this construction as creation, here the self is constructed 
through adjustment. The issue is for the actor to fit his/her self into the 
dominant character of the situation or structure: adjusting to an obdurate reality 
(Brown 1991). This parallels the emphases in affect control theory. As in 
affect control theory, identity theory can be tested through experimental and 
questionnaire techniques. Theorists such as Stryker (1980) and Rosenberg 
(1979) attempt to specify the predictable process by which role-making and 
changes in self-image occur. Others such as Ralph Turner, Viktor Gecas, and 
Louis Zurcher examine the fluidity of role-constructs, while admitting that 
these “mutable selves” (Zurcher 1977) have spatial, institutional, and temporal 
stability. 

All interactionists, however distinct their theoretical grounding, method- 
ological choices, and assumptions about the proper level of stability and 
reification, agree that self is not an object that has inherent meaning, but is 
a construct that 1s given meaning through an actor’s choices, mediated by the 
relationships, situations, and cultures in which she or he is embedded. 


MACROINTERACTIONISM 


The easy charge had always been that symbolic interaction was a micro 
sociological perspective, with no interest in structure, no belief in the power 
of organizations and institutions, and no constructs to examine such issues 
(Maines 1988, Strauss 1991, Hall 1987). As noted earlier when considering 
the macro-micro debate, such a charge had always been misleading, as Blumer 
(1969), for instance, regularly wrote about "acting units," rather than actors. 
Yet, in recent years, interactionists have more self-consciously addressed 
macro-sociological issues, using the intermediate level of mesostructure. 

This emphasis received prominence in the influential survey article by 
David Maines (1977) in the Annual Review of Sociology, "Social Organization 
and Social Structure in Symbolic Interactionist Thought," emphasizing the 
interactionist tradition of concern with structure, institutions, and organiza- 
tions (see Overington & Mangham 1982). The concepts of negotiated order 
(Kahne & Schwartz 1978), constraint (Denzin 1977, Farberman 1975), 
network (Fine & Kleinman 1983, Faulkner 1983), collective activity (Becker 
1982, Gilmore 1988), and symbolic meaning (Schmitt 1991, Manning 1992) 
provide an entrée to macroanalysis. 

A compatible perspective has been developed by organizational theorists 
who recognize the importance of the lived experience of being in organizations 
(Dimaggio & Powell 1991, Hodson 1991) and the effects of webs of meaning 
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and culture in organizational life (Pfeffer 1981, Ouchi & Wilkins 1985). Some 
Organizational theorists suggest that organizations are characterized by 
‘loosely coupled” systems (Weick 1976), are fundamentally anarchic (Cohen 
et al 1972), and have recognizable cultures (Zucker 1977, Kamens 1977). 
That actors are “corporate” in that they represent positions or agencies does 
not mean that the interactionist perspective on social action is irrelevant. That 
these are “symbolic persons” makes the dramaturgical and interpretive 
perspective more powerful, if it admits that these actors are motivated by 
corporate impression management and limited by organizational structures. 

While interactionists have much to accomplish in opening their approach 
to the economic and political analysis of social systems (but see Burawoy 
1979, Smith 1991), there is a compelling argument that organizational fields 
(Strauss 1982) are structured through symbolic negotiation, and, as a 
consequence, there is little difference from small-scale negotiations. Ulti- 
mately a localized economic organization—a market of sellers and buyers— 
emerges from the structural conditions in which it is embedded. While 
seemingly far distant from the examination of interaction systems, all 
large-scale systems are ultimately grounded in the symbolic constructs that 
individuals use in coping with their local reality. 


POLICY-RELEVANT INTERACTIONISM 


According to some critics, interactionists are little concerned about amelio- 
rating the world around them, and are fundamentally apolitical and apathetic 
(Gouldner 1970, Huber 1973). Such a charge ıs strange aimed at an approach 
that descends from pragmatic philosophy, perhaps the American philosophical 
tradition with the greatest commitment to improving the world. Both Mead 
(Shalin 1987) and Blumer (Wellman 1988) had strong political leanings, 
evident in their writing. Mead was a progressive activist, involved in Chicago 
progressive politics. Blumer was, for portions of his career, a labor mediator 
and held strong opinions about racial discrimination. Indeed, Blumer’s first 
major empirical study, funded by the Payne Study and Experiment Fund 
(Blumer 1933) had a specific policy focus—to examine the effects of film on 
youth (Denzin 1992, Clough 1988). 

Two arguments have been advanced to explain why interactionists are not 
actively involved in policy debate and political action: one methodological 
and one theoretical. Methodologically it is asserted that since interactionists 
often avoid statistical techniques, their conclusions are viewed skeptically by 
policy opponents, who claim that the bias of the individual researcher infects 
the data. No doubt when policy makers were themselves convinced of the 
desirability of having precise, "objective," and confirmable data this was so, 
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but there is an increased willingness to examine evaluations tied to the 
understanding of actors (Patton 1980). 

Theoretically interactionists believe that truth is a social construct, but that 
position cannot be equated with claiming that any course of action is as good 
as another. Yet, if truth is grounded in one’s perspective, this seems to suggest 
that state action, constraining individual choice, is unwarranted, because no 
person could make a responsible choice for another. As a result interactionism 
is seen as profoundly anarchic (Lofland 1988) or libertarian (Fine 1993). Yet, 
if one specifies collective goals, admittedly from a political or cultural stance, 
researchers can suggest how those goals or means can be achieved. Further, 
a radical relativist position has never been central to interactionism, as 
obdurate realities and collective meanings have long been recognized. While 
it might be argued on an esoteric theoretical level that racial discrimination 
or battering children could be defined as right, within the society in which 
we reside, within our universe of discourse, such choices are abhorrent. 

Interactionists have addressed both the general issue of policy research 
(Estes & Edmonds 1981), and specific applied and policy domains (Kreps 
1989, Glassner & Freedman 1979, Corbin & Strauss 1988). Indeed, symbolic 
interaction is now increasingly influential in such “professional” fields as 
social work, nursing, education, and theater arts. In practice, interactionist 
research demonstrates its value for those who wish to make the world a better 
and more secure place in which to live. 


WHENCE SYMBOLIC INTERACTION? 


One is challenged to take the measure of a broad and vibrant perspective: 
contemporary symbolic interaction is both. In my title, I raise a triple paradox: 
how can interactionism simultaneously be in a state of demise, disappearance, 
and triumph. Each claim refers to the phenomena described above: fragmen- 
tation, expansion, incorporation, and adoption of symbolic interaction. Those 
characteristics that once propelled symbolic interactionism into its distinctive 
oppositional stance now have less significance, raising the question of whether 
the position of symbolic interactionism has been redefined. Will symbolic 
interactionism confront the discipline from outside, or have substantial 
changes already occurred, altering the mission of those who draw on Blumer’s 
Three Premises? Like many “special” populations, have we been 
mainstreamed? 


DEMISE In one sense, no demise in symbolic interaction is evident: the theory 
has not been discarded as old, irrelevant, wrong, or useless. Organizationally 
growth has occurred. Perhaps some would argue there has been a decline in 
the number of graduate students or centers for training, but one might respond 
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that graduate training is less narrow. Rather, the demise—if we label it 
that—results from fragmentation and the feeling that few core beliefs are 
universally accepted. The death of Herbert Blumer cost the perspective its 
charismatic leader. The journal Symbolic Interaction publishes articles by 
those who do not see themselves as interactionists and are not so labelled by 
others. Others who are intellectually compatible with interactionism do not 
use that term to describe themselves, not because they see it as stigmatizing, 
but because it is irrelevant. Central assumptions have faded as the generators 
of research and have not been replaced: the perspective is multifocal. The 
center did not hold. 


DISAPPEARANCE As long as a journal, an organization, and people who 
choose the label exist, symbolic interactionism will not disappear. Yet, the 
lines separating this perspective and the discipline as a whole have become 
blurred and uncertain. In other words, the concepts of interactionism have 
been given over to the mainstream. It is not just the lack of a center, but the 
existence of a periphery that does not belong to the perspective alone. Will 
we need a group of sociologists who choose to label themselves by this ancient 
moniker, while others share their work? 


TRIUMPH As noted, the concepts of interactionism have become the concepts 
of much sociology. This is surely not an insignificant achievement for a 
perspective that had recently been written off as intellectually exhausted. The 
leading journals of the discipline now regularly publish qualitative, interpre- 
tive research, written from numerous angles. Social constructionism, the 
sociology of emotions, identity theory, the post-modern turn, organizational 
culture, negotiated order, frame analysis, rhetorical analysis, the reconstruc- 
tion of the past, the sociology of temporality, and the analysis of gender, 
class, and race as situated achievements all rest on interactional frameworks, 
even if contributions to these arenas are broader than from interactionism 
alone. 

If the goal of symbolic interaction 1s to maintain itself as a distinctive 
oppositional movement, then it has failed, with more and more outsiders 
addressing central issues and more and more insiders stepping outside the 
boundaries, not caring about their badges of courage. Yet, if the ultimate goal 
is to develop the pragmatic approach to social life—a view of the power of 
symbol creation and interaction—then symbolic interaction has triumphed 
gloriously. 

Predicting the future is dangerous, but it is evident that the label symbolic 
interaction will abide—it hosts a cheery and sociable club. Its journals will 
remain strong. Yet, we will find more intermarriage, more interchange, and 
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more interaction. Symbolic interaction will serve as a label of convenience 
for the future, but will it serve as a label of thought? 
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Abstract 


The focus of this review is macro-level organizational change in medical care 
organizations in the United States, with particular emphasis on the decade of 
the 1980s. We begin with a brief review of the historical context of this sector 
and discuss several important trends characterizing the industry since 1980. 
The body of this review focuses on major perspectives in organizational theory 
which either have been used or could be used to studv these changes. We 
consider both determinants and consequences of change in the medical care 
sector. 


INTRODUCTION 


This review may be attempting the impossible: to give structure and boundaries 
to an area of research that has exploded over the past decade—the study of 
medical care organizations and how they have changed. In the 1980s it became 
a commonplace to speak of change in the environment of health care 
organizations; particularly change in regulations and reimbursement policies, 


89 
0360-0572/93/0815-0089$2.00 


90 | FENNELL & ALEXANDER 


health care costs, patient expectations, provider characteristics, technological 
innovation, population demographics, and major diseases. Today, medical 
care organizations themselves have changed so drastically that it is no longer 
clear exactly what 1s foreground and what is background, what is cause and 
what is consequence. 

Át the same time, the field of organizational sociology has experienced 
something of a renaissance, a flowering of diverse theoretical perspectives, 
most of which give central emphasis to the environment as a force shaping 
organizational structure, change, and performance. Although the application 
of organizational theory to healthcare organizations has a long and respected 
history (Perrow 1961, Heydebrand 1973, Georgopoulos 1972), some have 
argued eloquently that we no longer merely apply organizational theory to 
health organizations; the theories are themselves informed, refined, reformu- 
lated, and developed as a result of our observations in this rapidly changing 
sector (Mick 1990). 

Further, the "marriage" of organizational theory and health services research 
has been reflected by a noticeable increase in the number of such articles 
published, as well as by the number of journals in which such work now 
regularly appears. This is a field of research where multiple disciplines are 
active, fundable, and prolific. A secondary objective of this review, then, is 
to introduce and summarize for the sociologist-reader the major work that has 
appeared on this topic in nonsociological outlets. 

To give some structure to this burgeoning field we are necessarily selective 
in what we cover. Substantively, we focus primarily on US general hospitals 
and "alternative provider" organizations (e.g. HMOs, PPOs, and group 
practices). Temporally we emphasize the decade of the 1980s and the early 
1990s, 1n both sector changes and literature to be covered. The body of our 
review focuses on several major trends observed in the transformation of 
medical care organizations, and we examine how contemporary perspectives 
such as strategic choice, population ecology, and institutional theory are being 
used to study the determinants of these major changes. Finally, because the 
consequences of those changes are still unfolding, we speculate on how one 
might analyze those consequences at both the organizational and system 
levels. 


EMERGING TRENDS OF THE LATE 1980S AND EARLY 
1990S 


The Historical Context 


Over the last half of the twentieth century we have witnessed the context of 
health care evolve and continue to evolve from individual, patient-physician 
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relationships to care provided by organizations. For many years, organiza- 
tional scientists focused on hospitals as closed systems. These organizations 
were interesting because they departed in their structure and bebavior from 
the standard Weberian model of a bureaucratic organization. Studies typically 
focused on the unusual organizational structure of hospitals (e.g. dual lines 
of authority), on the role of professionals (e.g. physicians) in organizational 
settings (Freidson 1975, Strauss 1985), or on identifying appropriate structures 
to achieve positive organizational performance or higher quality care 
(Georgopoulos 1972, Heydebrand 1973). 

It has been recognized more recently, however, that the structural forms 
assumed by hospitals and other health delivery organizations in this country 
have been directly influenced by external forces such as the profession of 
medicine, and indirectly by medicine's influence over federal and state 
regulatory statutes pertaining to health care delivery and reimbursement. These 
regulatory statutes have prescribed, for example, the number and type of 
personnel who can be employed, the extent to which organizations can engage 
in overt competitive activity, and the rather unusual organizational structure 
that has separated the organization's administrative management from the 
medical staff. 

The state's influence on health delivery organizations can be traced in the 
post-war era through expansionary programs such as Hill-Burton in 1946 and 
the introduction of Medicare and Medicaid in the mid-1960s. The Hill-Burton 
program provided direct stimulation to hospital construction, and Medi- 
care/Medicaid further stimulated hospital expansion by providing payment of 
depreciation on hospital assets. More recently, the influence of the state has 
shifted away from increasing access to hospital care, toward containing 
hospital costs and encouraging competition among health care providers. Of 
critical importance has been the radical shift in the hospital payment structure 
for Medicare, from cost-based to prospective payment (PPS), adopted in 1983. 
These shifts ran parallel to major organizational restructuring in the heaith 
care sector. 

There are at least three major trends characterizing health care organiza- 
tional change during the past decade: (1) an increase in the diversification of 
organizational types and products; (ii) change in traditional ownership and 
management configurations; and (ii) the development of new interorganiza- 
tional arrangements and multitiered governance structures. This section briefly 
reviews the recent literature on each trend. 


Increased Diversification 


The field of health care organizations became increasingly more diverse during 
this period, in at least two ways. New forms of providers have emerged, such 
as free-standing emergency centers (FECs) and ambulatory surgery centers 
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(ASCs), which compete with hospitals in the provision of episodic emergency 
care and outpatient surgery. The numbers of these new types of providers 
mushroomed during the 1980s (Updaw 1987). Within hospitals, however, 
diversification has itself become a frequently invoked and discussed strategy. 

Shortell and colleagues (1987) defined hospital diversification as the 
provision of “new services for new markets” (p. 5), which do not involve 
acute inpatient care. Examples include home health programs (Balinsky & 
Starkman 1987), health promotion programs, geriatric programs, and outpa- 
tient surgery units. Diversification allows the hospital to reduce its vulnera- 
bility to uncertainty within the inpatient, acute care sector, by broadening its 
base of activity. Another form of diversification is the ownership of a variety 
of different levels of care (primary, secondary, tertiary, etc) geographically 
spread throughout different regions (Brown & McCool 1986). 

Diversification may lead to the related strategy of vertical integration, in 
which one health care organization (such as a hospital or a hospital system) 
provides all levels and intensities of service to all patients (Clement 1988). 
Vertically integrated systems are described in the hospital literature as a 
strategy with many potential benefits, including broadening the hospital’s 
referral base, attracting more ambulatory business to offset declining inpatient 
revenues, and forestalling competitive physician activity. Many have argued 
that vertical integration is the preferred strategy, because it supports the 
hospital’s core business, whereas diversification moves resources away from 
the core (Fox 1989). 


Ownership and Management Reconfiguration 


Changes in hospital ownership and management patterns over the past decade 
have drastically changed how we think about the traditional freestanding, 
not-for-profit, community hospital. In short, that breed of hospital has become 
increasingly rare. There are basically three major facets of ownership/man- 
agement reconfiguration: (i) an increased concentration of the hospital industry 
through closure and merger; (ii) a rise in the prevalence of hospital corporate 
restructuring; and (iii) a blurring of the distinction between profit and 
not-for-profit ownership, and a change in management “climate” from 
emphasizing nonprofit service to competitive business strategies. 

The number of community hospitals in the United States has declined to 
approximately 5800 from a high of 6500 in 1975. This contraction has 
occurred through a variety of mechanisms, including mergers and closures. 
When mergers occur, acquiring hospitals are more likely to be large 
community general hospitals, whereas acquired hospitals are more likely to 
be specialty hospitals (Mullner & Andersen 1987). The number of hospital 
mergers has never been very high, and that number has reportedly declined 
in the early 1990s, probably as a result of a decline in potential merger targets 
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and also in response to increased attention to the hospital industry for 
enforcement of federal antitrust laws by the Bush administration (Baehr & 
Furlong 1991). 

The impact of hospital closure has had a more dramatic effect on contraction 
of the hospital industry during this period. A total of 323 hospitals closed 
during the period 1975 through 1979. The next five-year period witnessed the 
closure of another 279 hospitals, and from 1986 through 1988 the number of 
closures has increased to 342. Although there is an extensive literature on the 
determinants of hospital closure (Longo & Chase 1984, Topping 1991), 
relatively few studies have analyzed closure from a coherent theoretical 
perspective, and most report mixed findings. 

Corporate restructuring is defined as a type of reorganizational strategy that 
segments a hospital’s assets and functions into separate corporations. This 
type of change has been observed in other industries for decades but has only 
recently emerged in the hospital industry. Few studies are available on this 
strategy of ownership reconfiguration, and most of the early literature focused 
on proscriptive discussion of the expected gains of hospital restructuring 
(Memel 1986). Empirical analyses of either the consequences or the causes 
of corporate restructuring are relatively rare. An exception is the work of 
Alexander and colleagues (1988), who found that restructured hospitals tend 
to develop corporate-type hospitals boards with business-like orientations. 

In many ways, the trends toward industry consolidation and corporate 
restructuring have both contributed to the other trend in ownership/manage- 
ment reconfiguration. The distinction between for-profit and not-for-profit 
ownership used to be a more telling difference than is now the case. Within 
the HMO industry, a number of nonprofit HMOs have converted to for-profit 
status, particularly since 1983 when the federal government withdrew the loan 
program available to nonprofit HMOs under the 1973 Federal Health 
Maintenance Organization Act (Ginsberg & Buchholtz 1990). We have also 
observed a change during this period in the management “climate” of most 
health care organizations. This change is apparent in the increased emphasis 
on competitiveness and the shift away from non-profit service. Traditional 
wisdom held that investor-owned hospitals are more market driven than 
not-for-profit hospitals, more efficient, and less likely to emphasize patient- 
community welfare values. Recent evidence on these assumptions, however, 
is equivocal, and sometimes suggests the opposite (Sear 1991). Not-for-profit 
hospitals, public hospitals, and investor-owned hospitals are more likely to 
be similar than different in their adoption of the “business ethic” and strategic 
planning (Greaf 1988). Total quality management programs have inundated 
the hospital management/consulting industry and are found in all types of 
hospitals, as are marketing divisions, an emphasis on health care “products” 
and product-lines. Mick (1990) observed that the vocabulary of medical care 
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organizations has changed from “systems” to “industry,” and the symbolic 
value of that change is profound. 


New Interorganizational Patterns and Multitiered Structures 


The third major type of change we have witnessed in the medical care sector 
has been the proliferation of organizational structures that combine multiple 
organizations and that require more complex governance and decision-making 
hierarchies. A wide variety of such multi-institutional and multitiered struc- 
tures have emerged. To simplify our synthesis of studies in this area we use 
three categories: (i) multihospital systems (including contract management 
arrangements); (ii) loosely structured provider networks or coalitions; and (iii) 
formally structured ownership arrangements linking hospitals, physician 
groups, and third-party payors. 

Multihospital systems (MHS) are defined as the combination of two or 
more hospitals that are owned, leased, sponsored, or managed by a single 
corporate entity (AHA 1983). The number of hospitals owned by systems 
grew dramatically between 1970 and 1980 (AHA 1982), slowed somewhat 
in the 1982—1984 period (enactment of PPS), and then grew again from 1985 
through 1988. The actual number of MHS nationwide, however, has begun 
to fall, particularly among for-profit systems. 

Earlier work on multihospital systems (MHS) tended to be descriptive, 
nonempirical, and nontheoretical in nature. Very little attempt was made to 
distinguish among types of systems or to examine how systems vary in 
structure, process, or performance. As reviewed by Morrisey & Alexander 
(19872), the early literature on MHS usually assumed that hospitals within 
MHS enjoyed economic and managerial advantages over freestanding hospi- 
tals, that selection into systems is a random process, that the benefits of system 
participation are general across all types of systems and accrue at entry, and 
that hospitals participate in systems for the same reasons and under basically 
similar conditions. More recent analyses, however, have begun to discount 
such assumptions and provide more finely tuned pictures of the differences 
across systems on a number of dimensions. For example, Luke & Begun 
(1988) argue that the strategic behavior of small MHS will reflect the unique 
characteristics of their local markets more so than larger regional or national 
systems. Their analysis of 78 small MHS identified specific growth and action 
orientations that were associated with ownership status. Controlling for system 
size, Fennell & Alexander (1987) also found differences among nonprofit, 
for-profit, and religious system hospitals in the types of boundary-spanning 
strategies used. Governance within MHS differs, depending upon system size, 
ownership, and spatial distribution of members (Morlock & Alexander 1986), 
as does the location of decision-making authority—by the system headquarters 
or by member hospitals (Alexander & Fennell 1986). 
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Contract management has often been considered a special type of 
multihospital arrangement. A "full service contract arrangement" is defined 
as the engagement of an outside firm by the hospital's board to manage 
the hospital's affairs for a negotiated fee (Kimberly & Rosenzweig 1988). 
The number of such arrangements grew rapidly in the 1980s and has made 
substantial inroads in the rural hospital market. Management firms typically 
manage a number of hospitals or health care organizations, so for a hospital 
to enter into a management contract is in effect to become part of a 
multihospita| system. However, work by Morrisey & Alexander (1987b) 
convincingly argues that system acquisition and entering into a management 
contract are distinctly different strategies, selected under different conditions 
for different reasons. Their analyses suggest that system-owned and con- 
tract-managed hospitals should be considered separately, rather than pooled 
in any analysis of performance differences. Contract management appears 
to be favored by hospitals when the hospital mission and the system mission 
are in conflict (e.g. religious hospitals and for-profit systems); acquisition 
or merger is likely to occur only when system and target hospital objectives 
are compatible. 

Another set of multr-institutional arrangements to be introduced during the 
past decade emphasizes cooperation and coordination among hospitals and 
providers without the radical fiscal and legal reorganization of mergers, 
corporate restructures, multihospital system acquisition, or contract manage- 
ment. Multi-institutional "networks" allow for coordinated strategic action 
across providers toward specific objectives, without sacrificing organizational 
autonomy, either legally or functionally. Included among these arrangements 
are voluntary systems, affiliations (Budd & Ross 1988), consortia (Christian- 
son et al 1990), confederations, alliances (Zuckerman & D'Aunno 1990), 
quasi-firms (Pointer et al 1988), and federally sponsored networks (Fennell 
& Warnecke 1988, Kaluzny et al 1990). The diversity of these arrangements 
is infinite, in terms of the types of organizations to become involved (from 
hospitals to nursing homes to physician groups), the types of linkages 
developed (both horizontal and vertical), and the foci of the network 
(educational programs, innovation diffusion, joint purchasing, and discharge 
planning). Many writers have suggested that these more loosely structured 
networks provide a viable alternative to vertical integration, without the costs, 
risks, and burdens of ownership of multiple businesses (Zuckerman & Kaluzny 
1991). 

Finally, the third type of multi-institutional arrangement to be recently 
observed is a formal arrangement of joint ownership or contractual agreement 
across different types of providers and third-party payors. The distinctive 
features of these arrangements are that they constitute formalized inter- 
organizational structures, that they join different types of organizational 
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actors, and that the linkages among them may develop hybrid, multitiered 
governance and decision-making hierarchies. 

Joint ventures between hospitals and physicians provide the most common 
example in the health industry, although hospital group joint ventures and 
preferred provider organizations (PPOs) linking physicians, hospitals, and 
third party payors are also included. A joint venture is usually defined as a 
voluntary association that results in a contract to pursue a specific enterprise 
or transaction (Troyer 1986). Rosenfield (1985) provides a useful discussion 
of various joint venture models, from contractual to corporate, general and 
limited partnerships. 

Again, descriptive and proscriptive work abounds on hospital-physician 
joint ventures, particularly as a strategic response to prospective payment 
(Greifinger & Bluestone 1986). A more recent trend in the literature, however, 
has been to conceptualize the physician-hospital joint venture as one of many 
possible strategies for managing the relationships between physicians and 
hospitals (Smith et al 1990, Glandon & Morrisey 1986). Work by Shortell 
and his colleagues has been most instrumental in defining the emerging field 
of hospital-physician relationships and physician-hospital organizations 
(PHOs; see Shortell 1991). The underlying premise of this genre is that the 
physician-hospital relationship will become ever more critical as competitive 
pressures in the health industry build, and as the impetus toward systems of 
managed care grows stronger. According to Shortell (1991), the success of 
hospitals and physicians in dealing with their ever more complex environments 
requires that both parties understand the complexities of that environment, 
that they understand each other's goals, values, and cultures, and that they 
develop an ongoing partnership built on trust and communication. 

Preferred provider organizations are defined as organizations of physicians 
and hospitals that contract with employers (or other purchasers, or insurers) 
to provide comprehensive health care services to enrollees on a fee-for-service 
basis (Cowan 1984). In the mid to late 1980s most PPOs were sponsored by 
hospitals and/or physician groups (53% of reporting PPOs in 1986), although 
insurer-sponsored PPOs have begun to have a significant presence (Dalton 
1987). PPOs have grown rapidly since the mid-1980s, primarily because they 
offer a structure for reducing health care costs without totally eliminating 
patient choice. Patients enrolled in PPOs are able to seek treatment from any 
physician or hospital within the PPO network, but unlike capitated and closed 
panel HMOs, the PPO preserves the traditional fee-for-servce payment 
method. The key to their cost-reduction capability, however, is two sided: 
they can combine negotiated fee discounts from member hospitals and 
physicians with a managed care strategy of service provision (Zwanziger & 
Auerbach 1991). 

Although numerous models of case management have appeared in both the 
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medical and social services literature (Merrill 1985), most medical case-man- 
agement programs focus on the recruitment of medical providers (physicians) 
who assume responsibility to provide or authorize (or both) a specified range 
of medical services to enrollees. Usually, enrollees relinquish their choice of 
provider after selecting a case manager, and the case manager then becomes 
the gatekeeper and sole point of access to services (Freund 1984, Hurley & 
Fennell 1990). The two central purposes of most medical case management 
programs are to assure enrollees' access to a stable, certain supply of medical 
services, and to impose oversight on the utilization patterns of enrollees. From 
the purchaser's (or third-party payor's) viewpoint, then, case management 
imposes an additional governance structure on the medical service transaction, 
by enlisting providers (primary care physicians) in the gatekeeper role. From 
the patient’s view, case managed systems may be quite different from 
traditional delivery systems, while from the provider's viewpoint, acceptance 
or animosity toward managed care may depend on specific program charac- 
teristics and conditions. Programs using third party brokers or intermediaries 
(commerical health management companies) may encounter considerable 
physician resistance (Kahn 1987). 

In the next sections we review studies that have attempted to analyze these 
recent trends from various theoretical perspectives. We begin with analyses 
of the determinants of major organizational change in medical care, and we 
consider three types of perspectives or theories: (1) strategic choice models, 
(ii) population dynamics, and (ii) change in technical and institutional 
environments. 


THE DETERMINANTS OF CHANGE 


Studies of change in US health care organizations, as with the study of 
organizational change in general, have usually adopted one of two basic 
models. Change is either conceptualized as the outcome of a rational, strategic 
process of decision-making in which the organization (or the healthcare 
system) actively chooses one course of action over another (usually in response 
to environmental threat), or change is the outcome of environmental selection 
processes that are outside of the control of any individual organization. In an 
early review of the literature on managing change in health care organizations, 
Kaluzny & Hernandez (1983) labeled these the rational and population ecology 
models. 

Conceptualizations of the environment have also evolved into two catego- 
ries of environmental constraints or characteristics: technical and institutional 
environments (Meyer & Scott 1983). Technical environments involve aspects 
of the organization's task environment related to the production and market 
exchange of goods and services; institutional environments encompass the 
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elaborate rules and requirements to which organizations must conform if they 
are to receive the social and political support and legitimation needed to 
survive. All organizations are subject to both technical and institutional 
constraints, although in varying levels of intensity. Historically, hospitals have 
experienced strong pressures from both contexts, given the technical demands 
of modern medicine and calls for cost and quality control, as well as the broad 
array of governmental regulations and requirements, professional expecta- 
tions, and certification processes (Alexander & Scott 1984). 

In this section we examine studies that have applied either strategic choice 
or population ecology frameworks to health care organizational change. We 
also review the extent to which technical and institutional constraints have 
been invoked as the impetus for change in this sector. 


Strategic Choice and Resource Dependence 


The notion of organizational change as strategic choice or adaptation has been 
a popular one in health care organizational research. It is a useful and relevant 
approach for health care administrators, in that the strategic choice model 
attempts to explain hospital behavior 1n terms of monitoring, anticipating, and 
responding to changes ın the environment (Kimberly & Zajac 1985). Through 
the early to mid 1980s strategic choice was espoused by a number of influential 
writers in health care, and calls for research multiplied (Shortell et al 1985, 
Bigelow & Mahon 1989). Several of the reorganizing trends discussed above 
have been studied as adaptive strategies to avoid closure or to survive in a 
turbulent environment. Examples include Alexander (1990), Bell & Bell 
(1991), and Giardina and colleagues (1990) on diversification, Mick & Conrad 
(1988) on vertical integration, and Halpern et al (1992) on rural hospital 
affiliation with multihospital systems. The Shortell framework (Shortell et al 
1985) has been considered by many to represent one of the most coherent 
statements of this perspective, offering a number of explicit research questions 
in need of research. 

Although the number of published articles on strategic choice and strategic 
management in health care organizations has mushroomed (see review by 
Shortell & Zajac 1990), the extent to which this research constitutes a coherent 
body of knowledge is questionable. An extensive meta-analysis of this 
literature was conducted by Topping & Hernandez (1991), covering 160 
articles organized into 18 research programs. In each of three central areas 
of strategy (strategy formulation, strategic content, and strategy implementa- 
tion/evaluation), Topping & Hernandez analyzed the extent to which available 
research constituted a coherent program of some scientific rigor. Rigor was 
defined as conceptual adequacy, methodological rigor, and empirical accu- 
mulation. The research programs on very few strategy topics were evaluated 
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as scientifically rigorous, particularly in studies of strategy formulation and 
implementation/evaluation. These authors concluded that too much of the 
strategy research in health care organizations tends to be prescriptive in nature, 
providing anecdotal evidence of past successes, with little conceptual or 
methodological ngor. There are still many unanswered questions about the 
process of strategy formulation and implementation, the conditionalizing 
effects of context and history, the impact of strategy on performance, and 
comparative analyses of alternative strategy options. 

One area of strategy research in which there has been renewed interest (one 
of the few exceptions to the above conclusions) 1s that of the role of hospital 
governing boards in the strategic management process. Governing boards have 
long been ignored in the hospital organizational literature, due to the post-war 
environment of growth and resource plenitude. In that environment, it was 
assumed that most hospital boards performed either a purely external linkage 
role (Pfeffer 1973), or a symbolic role (Starkweather 1988), and had little to 
do with long-range strategy formulation or day-to-day operation of the 
hospital. In the current post-PPS era, however, such a passive role is no longer 
defined as appropriate, and the performance and vitality of governing boards 
has become a topic of considerable interest to foundations and health care 
management associations (Kovner 1985). 

Recent work by Goodstein & Boeker (1991; and Boeker & Goodstein 1991) 
has resulted in empirical analyses of the impact of changes in management, 
ownership, and board composition on the process of strategic change. Unlike 
most strategic management studies, these authors examined strategic change 
over time, by examining changes in hospital services over a six-year period. 
The scope of hospital services was indeed influenced by changes in board 
composition and structure, independent of the influence of managerial change 
and ownership change. Another recent set of analyses has focused upon the 
interaction of board configuration and life-cycle stage on CEO turnover 
(Alexander et al 1993), and on major hospital reorganization (Fennell & 
Alexander 1989). 

The resource dependence framework also assumes the organization has 
some active role in responding to environmental influences but focuses 
specifically on the linkages developed between organizations in order to 
acquire needed resources. There is a limited literature on health care 
organizations that explicitly incorporates this perspective, and most of it has 
been sociological in focus rather than management-oriented. Provan has 
studied decision-making within multihospital consortiums (1984a) and pro- 
vider networks (1984b), using a resource dependence framework, and Fennell 
& Alexander have compared system and freestanding hospitals in strategies 
to either buffer or bridge in response to regulatory constraints (Fennell & 
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Alexander 1987). Both sets of studies confirm the utility of the framework in 
understanding when and where linkage development is an appropriate strategy. 


Population Ecology 


The population ecology framework uses a completely different perspective 
on what causes change in health care organizations, and a different level of 
analysis as well. Population ecologists focus on organizational populations, 
rather than on the actions of individual organizations, and they define 
environmental forces as the catalysts for all significant population level 
changes in form or type. The actions (or inactions) of managers are much 
less important in this perspective, and the timeframe for change is much 
longer. The study of population-level dynamics of organizational growth, 
variation, replacement, and survival unfolds over decades rather than years, 
and the need for systematic population-level longitudinal data is crucial. In 
the US medical care sector, population level changes over the past 25 years 
have been significant, and luckily, data on most organizational forms are 
available. 

Surprisingly, however, the number of empirical studies of health care 
organizations using a population ecology framework is somewhat small. In 
1987 a special issue of Medical Care Review took as its task to examine (and 
review) the structure and operations of health services organizations from the 
population ecology perspective. Five invited papers analyzed different health 
care organizations, including hospitals (Alexander & Amburgey 1987), 
hospital federations (D' Aunno & Zuckerman 1987), rural primary care centers 
(Ricketts et al 1987), and medical practice organizations (Rundall 1987). Each 
of these papers provided a revicw of current research on the organization in 
question and developed a number of research questions derived from 
population ecology, but analyses of data were limited to descriptive data on 
trends. Since that issue, very few empirical studies of the dynamics of change 
in populations of health care providers have appeared in print [see additional 
ecologically focused reviews on HMOs by Christianson et al (1991), and on 
the diffusion of magnetic resonance imaging by Renshaw and colleagues 
(1990)]. 

Both population ecology and strategic choice frameworks emphasize 
environmental change as a key determinant of organizational change. Al- 
though neither framework has been entirely successful in explaining the 
patterns reviewed in the previous section, a number of studies have made a 
contribution to each perspective by more carefully defining the dimensions 
of the health care environment that require attention. In particular, writers in 
the institutional perspective have developed the distinctive characteristics of 
technical and institutional environments that have acted as catalysts of change 
in the medical care sector. 
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As reviewed earlier, Meyer & Scott’s work has defined two categories of 
environmental! constraints, technical and institutional. Technical environments 
relate primarily to market and production concerns; institutional environments 
involve the social and political structures to which organizations must 
conform. Although both sets of constraints are of importance to medical care 
organizations, each category involves a variety of environmental influences. 
Scott (1987) provides a useful summary of institutional theory through the 
mid-1980s and traces the many variants that have developed, including 
multiple institutional environments or forces to which organizations must 
respond (DiMaggio & Powell 1983), such as public opinion, educational 
systems, legal structures, professions, ideologies and regulatory structures. 
Similarly, technical environments include demographic characteristics, tech- 
nological factors, and market structures. 

Rarely are both technical and institutional constraints considered jointly 
in research on medical care organizations. Most of this research has 
attempted to identify and measure a single environmental factor, or at most 
two, and traced the impact of change ın that factor on change in medical 
organizations. Further, only a few empirical studies of medical organizations 
have actually incorporated an institutional theory perspective (Fennell & 
Alexander 1987, Fennell et al 1987), and fewer still have considered more 
than one environmental factor. Another complication is the fact that 
environmental factors in the health industry are themselves subject to change, 
particularly regulatory programs and market conditions. Thus, the impact 
of one factor can change over time, and different factors can gain or lose 
influence over a relatively short period of time. For example, research by 
Wilke & Choi (1988) demonstrated that models to explain hospital acqui- 
sition by multihospital systems differed over a five-year period, underscoring 
the need for multiple models to capture the changing dynamics of the 
medical care environment. 

In this section of our review we briefly examine a number of studies, most 
of which focus on a single environmental factor. However, we organize this 
section into what we think are key attributes of both the technical and 
institutional environments of health care, and we hope that this categorization 
itself contributes to our understanding of environmental catalysts of change. 
Whether our level of analysis is the individual hospital (as in strategic choice) 
or the population of organizations (as in population ecology), we should be 
moving toward the development of change models that allow us to vary more 
than one environmental factor simultaneously, incorporate change in the 
environmental factors themselves, and allow us to consider the possibility of 
interaction between exogenous factors. 
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THE TECHNICAL ENVIRONMENT At a minimum, we suspect that three 
different aspects of the technical environment need to be considered in building 
models to explain change in med.cal care organizations: sociodemographic 
change, technological change, and market change. 

Among the most important demographic changes currently facing (and 
confusing) the US health care system are the increased proportion of elderly, 
and the changing nature of the AIDS epidemic. Each of these changes presents 
crucial contingencies for the delivery of medical care and the structure of 
medical care organizations, markets, and systems. 

The graying of the American population reflects both increased average 
longevity and the aging of the “baby boom” generations. Typical projections 
call for at least 15% of the US population to be age 65 or older by 2020. This 
IS the segment of the population that consumes more hospital and health care 
resources than any other. The implications of the aging population require an 
examination of trends toward hospital diversification into long-term care and 
geriatric services, and a focus on hospital discharge as the key entry point 
into the long-term care system, if not the coordination point for both acute 
and long-term care. Similarly, it is now evident that the AIDS epidemic 
constitutes a crisis for the hospital delivery system. The pressure it puts on 
health care resources has only begun to be felt, especially in inner-city 
hospitals of major urban areas. The need for AIDS-related hospital services 
represents a curious combination of high cost and low return to most hospitals, 
thus the burden of such services may ultimately be borne by inner-city 
religious and county hospitals—the types of hospitals with a historical 
commitment to care for the poor and stigmatized. Little research currently 
exists on hospital decisions to provide such services, although it is clearly a 
decision-arena where many different constituencies and agendas come to bear 
(LeBlanc & Hurley 1992). 

Another key aspect of the technical environment of health care organizations 
is, of course, the nature and cost of medical technology Progress 1n medical 
treatment and the development of new medical technologies has been the 
focus of several voluminous research traditions (none of which are reviewed 
here; see instead Greer 1986, Kanouse 1989, Van de Ven et al 1989). Medical 
technology is usually considered one of the handful of precipitating factors 
in the post-war growth of the medical industry and the ascendance of the 
hospital as the primary setting for medical treatment. In the more recent 
period, however, the continued growth of medical technology can only be 
analyzed in tandem with other significant environmental changes, namely the 
prospective payment system. PPS established a set of financial constraints on 
hospital reimbursement that has channeled the direction of medical innovations 
toward outpatient diagnosis. New medical technologies (such as MRI) often 
do not require hospitalization but are so expensive that hospitals are still the 
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most likely purchasers of such equipment. Now, however, it is unlikely that 
the hospital will be the only purchaser; physician groups, provider networks, 
and joint ventures among multiple organizations are investing in the new 
medical technologies (Renshaw et al 1990, Scott 1990, McKinney et al 1991). 
Finally, the structure of medical markets has changed considerably in the 
recent period, as a result of other environmental changes such as regulatory 
changes at the state and federal level, and the emergence of horizontal and 
vertical mergers and diversifications. Clearly, cross-sectional research is 
totally inadequate to disentangle these various interrelationships. A study of 
three communities in California from 1979 through 1984 by Starkweather & 
Carman (1987) provides a particularly good example of how markets evolve 
over time. In this study the focus of competition shifted from competition 
between individual hospitals to market control (through affiliation and 
merger), to market expansion and joint venturing with physician groups. This 
final stage could best be described as competition between health plans. 


THE INSTITUTIONAL ENVIRONMENT Three aspects of the institutional envi- 
ronment of medical care organizations are considered here: regulatory change, 
change in the medical profession, and change in the normative environment 
surrounding the medical sector. Regulatory change corresponds loosely to the 
process of coercive institutional isomorphism discussed by DiMaggio & 
Powell (1983), and changes in the medical profession and the normative 
environment reflect normative isomorphism. 

Clearly, the most significant regulatory change of the past decade was the 
implementation of the prospective payment system (PPS) for Medicare 
reimbursement in 1983. PPS radically changed the payment structure for 
hospitals from cost-based to prospective payment, in which Medicare pay- 
ments were made at a predetermined, specified rate for each discharge. 
Hospital costs above the predetermined rate are disallowed; thus incentives 
exist to meet or fall below the established rate. 

The enormity of the impact of PPS has been likened to an environmental 
Jolt or shift for the entire hospital industry (Zajac & Shortell 1989). In more 
general terms, PPS can be seen as symbolic of a third stage in the role of 
government in the financing and regulation of hospital services. As described 
by Scott & Lammers (1985), the federal government's role has shifted from 
that of the builder of hospitals in the post-war period, to the purchaser of 
services through the 1950s and 1960s, to the regulator and catalyst of cost 
containment ın the 1980s and onward. Others have described the more recent 
period as that of the "refederalization of American health care" (Rabe 1987). 

The impact of PPS 1s still unfolding, but recent studies have begun to 
document its influence on a number of dimensions of hospital behavior. The 
use of various treatment technologies within hospitals has shifted; this is 
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particularly evident in the decreased use of routine tests such as serology and 
blood chemistry (Sloan et al 1988), and the diversification into outpatient 
services as reviewed above. And, as expected, PPS was instrumental in 
reducing the growth of Medicare inpatient hospital expenditures (Chulis 
1991). However, the total level of national health care spending has not 
declined, and there has not been a decrease in the demand for care, but an 
increase in the demand for “Medigap” insurance plans, to insure health 
expenses not covered by Medicare (Manning et al 1987). 

Significant changes have also occurred in the regulatory programs oversee- 
ing physician fees and reimbursement. Early attempts to control physician 
fees are exemplified by state programs of selective contracting. More recently, 
however, Medicare physician payment reform has taken the form of the 
resource-based relative value scale (RBRVS), developed by Hsiao (Hsiao et 
al 1988), and implemented by the Health Care Financing Administration 
(HCFA) in 1992. RBRVS has been called the “Robin Hood” of physician 
payment reform, as it supposedly robs from the surgeon to give to the general 
practitioner by increasing pay for cognitive care (time spent with the patient 
in diagnosis and consultation) and decreasing payment for invasive care 
(surgery, invasive tests, etc). The expectations behind this reform are both to 
change practice patterns toward less invasive, less costly care, and in the long 
run, to increase the supply of primary care physicians. A secondary goal of 
RBRVS is to address geographic inequities between rural and urban physicians 
through the use of a geographic adjustment factor, which increases payments 
to rural physicians (Ginsberg et al 1990). 

The medical and health care professions have also changed considerably 
in the recent past, and those changes are both cause and consequence of the 
organizational changes we have discussed. Physicians, of course, continue to 
be both a major resource and a source of control over hospitals, placing them 
both in the technical and institutional environment. However, those connec- 
tions are no longer simple and no longer the only professional group of 
consequence to hospitals. À growing proportion of physicians in the United 
States are now linked financially with hospitals, through contracts, salary, or 
partnerships (Scott & Lammmers 1985). In addition, the number of allied 
health occupations has increased dramatically, and these are directly linked 
to hospital occupational identities resulting from increased complexity in 
medical technology (Rundall 1987, Scott & Backman 1990). Of particular 
importance, as well, is the rise in the number and importance of professional 
managers within medical care organizations. These individuals acquire 
generalized knowledge in management theory and skills, and apply them to 
the medical sector. As professionals, they carry with them a competing set 
of norms, values, and expectations concerning administrative efficiency and 
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management control, in direct conflict with traditional medical values of 
individual autonomy and physician dominance. 

Related to this issue of professional norms and values is the more general 
issue of change in the normative structure of the medical care sector. For 
much of the twentieth century, hospitals have been subject to a broad array 
of governmental regulations, professional certification procedures, and pro- 
fessional ideology. The general normative structure of medicine has always 
supported the dominance of the medical profession, the autonomy of the 
individual physician, and the greater value of care quality over care cost. A 
number of the developments in the medical sector we have reviewed here, 
however, suggest that the traditional normative structure of medicine has 
begun to erode. Alexander & D’ Aunno (1990) have suggested that institutional 
constraints in the medical sector have recently undergone “some degree of 
entropy and delegitimation” (p. 67), and that emphasis has shifted toward 
technical constraints that emphasize efficiency, cost concerns, and the 
corporatization of medical care. 

Obviously, there is much we still do not understand about the determinants 
of these dramatic changes in the medical care sector. Although we have begun 
to map and organize the various contextual changes that have both contributed 
to and interacted with the organizational trends of diversification, owner- 
ship/management reconfiguration, and interorganizational structures, our 
models have been too simple and too constrained by cross-sectional method- 
ologies, additive and linear thinking, and narrow research topics. 


THE CONSEQUENCES OF CHANGE: SUGGESTED 
DIRECTIONS 


In this section we put aside our review function and think instead about 
possible research directions, using the conceptual frameworks we have seen 
invoked sporadically in research on the determinants of change in this sector. 
As we shift to a focus on the consequences of these structural changes, we 
find that the field is nearly “uncharted territory." We do not presume to present 
here a completed map; rather our choices of topics and possible approaches 
are quite selective and, we hope, suggestive of other directions. Our 
suggestions are organized by level of analysis; first we consider organization- 
level consequences, and then system-level consequences. 


Organization-Level Consequences 


Defining the boundaries of a medical care organization is no longer a 
straightforward task. Ten years ago, the obvious target for an analysis of 
organizational performance in providing health/medical care would have been 
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the hospital or the practice group (Flood & Scott 1987, Kahn et al 1992). 
Now, however, our focus slides from freestanding hospitals (of which there 
are increasingly few to observe) to the multihospital system, or to the 
diversified health system or health plan. Not surprisingly, our traditional 
definitions of "organization-level consequences" are also inadequate. How 
does one measure performance (either in terms of cost or quality of care) of 
a diversified system? How does one gauge the consequences of system 
affiliation for individual organizational components? And what are patient- 
level consequences of radical change in local medical care organizations, 
either in terms of access, out-of-pocket health care costs, or care quality and 
continuity? 

We suspect that performance-type consequences cannot be carefully mod- 
eled without a clearer understanding of the structural changes themselves, 
particularly the impact of those changes on internal governance relationships 
(Fennell & Alexander 1989). Authority structures and accountability to 
various constituencies have become very complicated, almost slippery, 
phenomena. Individual CEOs may be held acccountable to both local boards 
and system headquarters, and health systems or plans may combine local and 
federated governance structures. Variation in authority relationships and 
governance structures is now extensive in the medical sector, and those 
structures both set goals and define organizational objectives at multiple 
levels. 

The issue of timeframe is once again crucial. Changes in internal gover- 
nance structures are both set off by, and contribute to, radical structural 
changes such as merger, affiliation, or corporate restructuring. Both of these 
"endogenous variables" are influenced by a variety of contextual and historical 
influences, including previous levels of performance and changes in the 
institutional and technical environments. The "consequences" of profound 
organizational changes, then, are entwined in a complicated set of simulta- 
neous feedback loops that are probably hopelessly underidentified and subject 
to autocorrelation. 

But perhaps not. À variant of ecological models in organization theory is 
the life-cycle approach (see reviews by Carroll 1984 and Whetten 1987), in 
which organizational change is conceptualized in terms of stages of growth, 
decline, stability and instability. Specific issues and problems tend to cluster 
within these stages, leading to predictions concerning the types of strategies 
or choices to be made by organizations to either move into or out of particular 
stages. In addition to "stage shifts," however, we have seen that health care 
organizations are subject to a variety of more instantaneous jolts or abrupt 
shifts, either externally induced (for example, the implementation of PPS) or 
internally decided (to merge, or diversify). It may be possible to model the 
relationships between change in governance structures, response to external 
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jolts, and performance consequences, conditioned upon organizational life 
cycle. Such models need to be dynamic in nature, and allow for multilevel 
measures of performance, both short- and long-term. Further, we need to 
track the influence of change-events on subsequent changes in performance 
and subsequent change-events. Clearly, what we suggest is not a single study, 
but a whole program of research. 

Another set of issues to be considered is that of power and conflict between 
multiple stakeholder groups connected in complex health care organizations, 
such as health systems or plans that combine physician groups with hospitals, 
clinics, and specialized diagnostic or rehabilitation facilities. As more medical 
care organizations develop into complex hierarchical structures, the likelihood 
of conflict is increased, and this type of organizaticnal setting becomes 
appropriate for the study of inter-professional group conflict. That conflict 
may focus on control over organization-based technologies, domain, agenda- 
setting, definition of the organization's culture or identity, or definition of 
ethical values (Scott & Backman 1990). Institutional theory provides a 
conceptual framework for examining how conflict over values between 
different provider groups can influence organizational decisions concerning 
service delivery profile, provider compensation, and accountability structures. 
An additional perspective on inter-professional group conflict is found in 
resource dependence. Within complex medical care organizations, the devel- 
opment of coalitions between groups can be predicted on the basis of 
dependency relations, either historically based or structurally induced. Those 
connections (or intra-organizational networks) can then influence agenda-set- 
ting, goal definition, and the definition of values and ethics within the 
organization. 


System-Level Consequences 


By system-level consequences we mean sector, industrv, or field, in contrast 
to our discussion of multihospital systems, diversified health systems, or plans 
in the previous section. Here we briefly outline several areas in which the 
consequences of change in medical care organizations transcend those 
organizations, regardless of boundary definition. These are consequences that 
can impact the health care sector in general, rather than specific organizations. 

Although the pre-1980s literature on the diffusion of medical technology 
emphasized the speed or rate of diffusion (with an implicit assumption that 
"faster" was better), the cost-containment impetus of the current health policy 
arena has changed the direction of this research. The diffusion of medical 
technology in the post-PPS era has actually become at least two different 
topics, with system-level repercussions: (1) the development and diffusion of 
new medical technologies that are not subject to the constraints of PPS, and 
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(ii) the diffusion of new organizational forms or nontraditional ownership 
arrangements for the purpose of technology adoption. 

To avoid the constraints of PPS, at least two strategies have been observed. 
Hospitals are exploring the extent to which technology purchases can be 
shifted to outpatient facilities, thus avoiding inpatient charge limitations. 
Thus, technologies that can be so shifted are increasingly preferred. Second, 
treatment strategies that are process or “software based” (Fennell & Warnecke 
1988), rather than equipment-based are also advantageous, and more likely 
to be considered in the post-PPS era. Finally, we have also seen the 
development and spread of new organizational forms for the specific purpose 
of technology adoption, which allow for cost-sharing and some degree of 
buffering from cost-constraint mechanisms. This topic has been studied by 
Kimberly and colleagues (see Renshaw et al 1990), using the case of magnetic 
resonance imaging. These authors developed an ecological analysis of 
variation in types of ownership forms to develop and survive over the long 
term. This is certainly an interesting and potentially fruitful approach, 
combining classic diffusion issues with population ecology principles of 
variation and selection. 

Both of these diffusion topics beg the question, however, of the system-level 
consequences of emphasizing certain types of technologies or developing 
particular organizational forms for technology adoption. Those consequences 
can be defined as both long-term/diffuse (i.e. how the structure and field of 
medical innovation will change as a result of an emphasis on outpatient 
delivery), and more immediate/patient-centered (how access, patient costs, 
and care quality will be affected). The first, although important and in need 
of policy discussion, is beyond the purview of this review. The second 
question is also embedded in policy issues, but concerns the type of per- 
formance indicators familiar to researchers in the health services arena. 

Other system-level consequences need to be explored using population 
ecology. The structural changes we have identified in this review constitute 
the development of wholely different forms of medical care organizations— 
new populations of providers. The ecological processes of selection, variation, 
and retention in the medical care sector are unfolding rapidly and are ripe for 
the types of analyses identified several years ago. Alexander & Amburgey 
(1987) identified several problems that have inhibited the use of ecological 
analysis in the medical sector, including the task of developing appropriate 
classifications of hospital and provider populations. The classificatory task 
has become quite daunting, given the proliferation of multitiered governance 
structures, and multiple-provider type organizations (such as PPOs, diversified 
health systems, and joint ventures of various types). The definition of 
boundaries of types of health care organizations has become infinitely more 
complex. But such complexity provides an even more compelling reason to 
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pursue ecological analyses: the changing shape of the health care sector 
requires a sector or population-level perspective in order to accurately describe 
and analyze the nature and extent of those changes. 

Finally, the transformation of the institutionalized norms and beliefs 
supporting the medical care sector constitutes another important topic in need 
of careful study. Institutional transformation is a lengthy process, and it does 
not occur uniformly or without conflict. It is itself a type of diffusion process 
eventually linking regulatory programs, professional groups, corporate struc- 
tures, and localized health systems. Alexander & D’Aunno have argued that 
incomplete institutionalization occurs when competing forms of rationality 
exist, such as the traditional medical model vs. the emergent corporate model 
(1990). Indeed, a further proliferation of different organizational forms is 
more likely once the transformation process has begun. and competing belief 
systems engender conflict and uncertainty. We still have much to learn about 
that transformation process, as it occurs and as 1ts consequences unfold. 
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Abstract 


In the late 1940s Sutherland proposed that explanations of deviance and crime 
are either situational or dispositional, and that of the two, situational - 
explanations might be the more important. Nonetheless, with a few notable 
exceptions, for the next four decades sociologists focused on dispositional 
theories to the near total exclusion of situational variables. However, an 
increasing awareness of the theoretical limitations of strategies based only on 
dispositions has begun to encourage researchers to reconsider situational 
explanations. Most of the research that explicitly examines situational 
dynamics in producing crime has originated in experimental psychology, 
symbolic interactionism, or opportunity theories. Experimental research has 
helped to identify the situational correlates of crime and deviance, but lacks 
a theoretical framework for organizing its disparate empirical findings. 
Symbolic interaction research has emphasized the actor’s role in defining and 
interpreting situations but thus far has not provided a theoretical link between 
motivation, opportunity, and crime. Opportunity theorists, especially those 
studying victimization, have made the most progress toward developing a 
situational theory of crime, but their emphasis on the victim rather than the 
offender imposes serious theoretical and methodological limitations. 
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114 BIRKBECK & LAFREE 
INTRODUCTION 


Hirschi & Gottfredson (1986:58) recently proposed an important distinction 
between the concepts of crime and criminality. Crime refers to events that 
presuppose a set of necessary conditions (e.g. activity, opportunity, adver- 
saries, victims, goods). Criminality refers to stable differences across indi- 
viduals in the propensity to commit criminal (or equivalent) acts. Hirschi & 
Gottfredson point out that criminality is a necessary but not sufficient condition 
for crime to occur, because crime requires situational inducements in the form 
of motivation and opportunity. Hence, the number of crimes individuals 
commit may show a weak relationship to their criminality. This type of 
conceptual distinction is not new. In 1947, Sutherland (p. 5) argued that 
explanations of crime are either “historical” or “situational.” Historical 
explanations focus on the processes operating in the offender’s previous 
history and are therefore what Hirschi & Gottfredson (1986) call theories of 
cnminality. Situational explanations refer to the processes operating at the 
moment of the crime’s occurrence: they are theories of crime. Sutherland even 
felt that situational explanation could be “a superior” crime explanation 
(Sutherland 1947:5). 

However, despite the acknowledged importance of the situation in explain- 
ing crime, the most influential theories of criminal behavior have focused on 
criminality (e.g. Merton 1938, Cohen 1955, Cloward & Ohlin 1960). 
Sutherland himself went on to propose his famous dispositional theory that 
conceives of criminality in terms of learned techniques and justifications. The 
reasons for this focus on criminality are no doubt complex, but three seem 
especially important. First, the sociological training of most criminologists 
leads them to think of the setting for crime in terms of broader social 
phenomena, for example, subcultures (Cohen 1955) and parenting styles 
(Hagan et al 1985), rather than situations. These social variables are more 
easily linked to criminality than crime. Second, the obvious fact that 
individuals sometimes respond differently to the same situation (Thomas 1937) 
has encouraged criminologists to relegate situational experience to a status 
dependent on dispositions (Sutherland 1947:5). Finally, the systematic exam- 
ination of situational variables is theoretically and methodologically complex, 
requiring the definition of key concepts, the development of conceptual 
models of the interaction between actors and situations, and the design of 
appropriate empirical research (Pervin 1981). 

Nevertheless, an increasing interest in crime (in contrast to criminality) and 
hence, greater concern with the situational aspects of crime, can be observed 
in diverse areas of criminology (Gibbons 1971, Petersen 1977, Clarke 1980, 
McCarthy & Hagan 1992). For example, experimental psychology has 
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produced a large body of research on the situational characteristics of violence 
and aggression (Geen 1990), dishonesty (Nettler 1982) and substance use 
(Deardoff et al 1975, Forsyth & Hundleby 1987). Moreover, symbolic 
interaction theory has been applied to various crime situations, especially 
those involving violent encounters (Luckenbill 1977, Athens 1980. Felson 
1981). Similarly, in the last 15 years opportunity theorists (Hindelang et al 
1978, Cohen & Felson 1979, Cohen et al 1981b) have developed a situational 
model of crime that has been widely tested. Although less systematic, other 
researchers have examined various situational aspects of deviant behavior, 
including the “journey to crime” (Harries 1980, Rhodes & Conly 1981) and 
offense planning (Petersilia et al 1978, Feeney 1986, Erez 1987). While 
representing diverse theoretical backgrounds and methodological approaches, 
all of these studies point to an emerging sense that explanations of crime and 
deviance must attend to the situation. 

The purpose of this essay is to review and evaluate some of the major 
findings from the literature on the situational dimensions of criminal and 
deviant behavior. Because the term “situation” is open to so many interpre- 
tations, we begin by providing a working definition for the study of crime 
and deviance. We then divide the literature on crime Situations into experi- 
mental and nonexperimental areas. Most of the experimenial research on crime 
is conducted by psychologists; most of the nonexperimental research is by 
sociologists. We further divide nonexperimental research into studies based 
on either symbolic interaction or opportunity theory. After summarizing and 
evaluating each of these research areas, we conclude with an assessment of 
their contribution to the explanation of crime, and we cffer suggestions for 
future research. 


THE CONCEPT OF THE SITUATION 


The term “situation” enjoys wide and varied usage in the social sciences 
(Goffman 1964, Jarvie 1972, van Doorne 1981, Farr 1985), but it generally 
refers to the immediate setting in which behavior occurs (Stebbins 1972, cf 
Moos 1973, Pervin 1978, Furnham & Argyle 1981, Magnusson 1981). Precise 
definition and operationalization of the situation are difficult, partly because 
of the need to include the actor’s subjective representation of the setting 
(Stebbins 1981), but mainly because of the causal complexity engendered by 
“many different things.. happening simultaneously—things that are likely to 
have begun at different moments and may terminate dissynchronously” 
(Goffman 1974:9). Researchers must define the objective and subjective 
content of situations and the point at which changes in content mean changes 
in situations. These decisions are difficult yet essential if situational studies 
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are to proceed with comparable units of analysis. Thus, the definition of 
situations is a continuing challenge for sociologists and psychologists (Argyle 
et al 1981). 

Pervin (1978) distinguishes between stimulus, situation, and environment. 
Stimulus refers to “the specific object of the organism’s attention or response 
pattern” (Pervin 1978:79). A stimulus may be an organism, place, or thing 
and usually has a brief existence—of seconds, rather than minutes. Situations 
are groups of stimuli, involving “an array of objects and actions which cover 
a time span” (Pervin 1978:79). They are defined by who is involved, what is 
going on, and where the action is taking place. When one of the components 
changes, so does the situation. For example, a person reading in an office is 
one kind of situation; if the person answers the telephone, the situation has 
changed. Situations may last a few seconds or several hours. Environments 
include “an organization of discrete situations and characteristics which may 
be continuous across situations but relevant to each of them” (Pervin 1978:80). 
Residences and workplaces are examples of environments. 

Studies of single stimuli in relation to criminal behavior are rare, because 
crime is usually a response to several stimuli that occur simultaneously and 
researchers are interested in exploring their joint effect on behavior. Most 
research has therefore concentrated on situations and environments. Because 
environments are more inclusive and of longer duration than situations, their 
potential influence on crime is more varied. For example, Straus et al (1980) 
found a significant correlation between the number of children in the home, 
which is an environmental variable (Monahan & Klassen, 1982), and violence 
against both spouse and children. However, although more children in the 
home may lead to higher rates of parental and spouse violence, acts of 
aggression nonetheless occur in specific situations, such as conflicts over 
appropriate behavior. Thus, at the analytical level criminal acts are most 
closely tied to situations. 

In contrast, environments can be linked to crime in two ways. First, they 
can be treated as aggregates of situations that are more, or less, likely to 
produce crime. For example, all else equal, more children in the home leads 
to more conflicts over appropriate behavior. Second, they can be considered 
as relatively short-term influences on the disposition to commit cnme. For 
example, progressive exhaustion generated by a large number of interactions 
with children may lead to a shift from verbal to physical tactics of parental 
control in the household. 

In this essay we adopt Pervin’s (1978) definition of the situation, indicating 
a level of analysis between stimuli and environments. We define situational 
analysis as the search for regularities in relationships between behavior and 
situations. 
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EXPERIMENTAL STUDIES OF CRIME AND DEVIANCE 


Without doubt, the largest amount of experimental research on crime and 
deviance has focused on aggression. We identify five situational correlates of 
aggression based on Argyle et al's (1981) review of the psychological research 
(see also, Wiggins 1983). The first is goal blocking and frustration. Dollard 
et al (1939:1) hypothesized that "aggression is always a consequence of 
frustration," a proposition which led to extensive testing and discussion (Geen 
1990). In an exhaustive review of frustration-aggression research, Berkowitz 
(1989:60) concluded that indeed, frustrations can contribute to human 
aggression under some conditions. Two examples of those conditions are 
frustrations that are perceived as intentionally induced (Averill 1982) and 
strong unfulfilled desires (Buss 1963). 

Second, physical or verbal attacks can lead to aggressive responses. For 
example, Borden et al (1971) found that many individuals strongly counter- 
attack when physically provoked, and Greenwell & Dengerink (1973) report 
that subjects respond aggressively to apparent intentions of attack. Geen 
(1968) also found that verbal attacks elicit aggressive behavior. 

Third, researchers (Bandura et al 1963, Berkowitz 1974, Geen & Thomas 
1986) report that the presence of aggressive models, particularly in the media, 
produces imitative behavior in some subjects. For example, Black & Bevan 
(1992) asked adults attending commercial showings of either a violent or 
nonviolent filn to complete two scales of the Buss-Durkee aggression 
inventory (Buss & Durkee 1957) before and after viewing the film. Audiences 
who chose the violent film had an initially higher level of aggressivity, which 
was heightened after seeing the film. 

Fourth, people or objects in situations may serve as cues for aggression. 
For example, Milgram's (1974) widely cited research on obedience to 
authority found that more than half the subjects in an experiment would 
administer apparently near-fatal electric shocks to another subject when told 
to do so by an experimenter. Similarly, Kelman & Hamilton (1989) have 
studied “crimes of obedience,” some of which are precipitated by the physical 
presence and instructions of authority figures. Using meta-analytic procedures, 
Carlson et al (1990) report that objects such as weapons, hostile bumper 
stickers, and clothing can function as cues for aggression. For example, in 
support of Berkowitz & LePage's (1967) original study, Boyanowsky & 
Griffiths (1982) found that the presence of weapons increased citizen 
aggression in encounters with police. 

Finally, situations vary in the level of restraint on aggression. Some 
situations may threaten physical or nonphysical punishment for the aggressive 
subject (Dollard et al 1939, Burnstein & Worchel 1962), often tied to social 
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rules about the appropriateness of aggressive behavior (Fox 1977, Marsh et 
al 1978). In others, a process of deindividuation (Zimbardo 1970) may occur, 
in which individuals lose their identity and become part of the group. Bartol 
(1991:110) suggests that deindividuation is an important ingredient in mass 
violence. 

Experimental research on dishonesty, which includes lying, cheating, and 
stealing, has been less concerned with identifying its situational correlates and 
more interested in examining the relative contributions of personality and 
situation to dishonest behavior. In their classic study of dishonesty among 
school children, Hartshorne & May (1928) unobtrusively observed 10,000 
subjects who were provided with several opportunities for lying, stealing, and 
cheating. Correlations of dishonest behavior for each subject across the 
different situations ranged from r = —.003 to r = .312. Hartshorne & May 
concluded that dishonesty springs from situational experience, not from any 
internal disposition or personality trait. 

In a widely cited reanalysis of Hartshorne & May’s data, Burton (1963; 
see also, Nelsen et al 1969, Schaie & Parham 1976) found evidence of an 
underlying trait of honesty, but argued for its conceptualization in terms of 
varying levels of situationally consistent behavior. Thus, some children may 
learn to be dishonest in many situations, others may learn to be dishonest in 
only a few. Dishonesty research largely reflects the concerns of personality 
psychologists, who have long debated the ments of trait (Allport 1937) versus 
situationist (Hartshorne & May 1928) models of behavior and finally moved 
toward an interactionist stance (Bowers 1973, Endler & Magnusson 1976, 
Magnusson & Endler 1977, Endler 1984, Kenrick & Funder 1988, Krahé 
1990) that views behavior as a function of both the person and the situation. 

Although less attention has been given to the situational correlates of 
dishonesty, some findings can be gleaned from the literature. Likely reward 
has been found to affect response in tempting situations. Thus, students are 
more likely to cheat if they expect poorer grades (Howell 1938, Johnson & 
Gormly 1972), and citizens are less likely to return lost envelopes the larger 
the amounts of money they contain (Farrington 1979). Risk of apprehension 
is an important consideration for many subjects. The level of student cheating 
is significantly affected by the obviousness of the measure used to assess 
cheating (Howell 1938, Johnson & Gormly 1972) and by more intensive 
surveillance (Corcoran & Rotter 1987, Covey et al 1989), while cash in lost 
envelopes is less likely to be returned than money orders (Farrington 1979). 
Finally, individuals also consider the ethical dimension of situations. 
Farrington (1979) found that envelopes with money were less likely to be 
returned to intended recipients that were considered undeserving. 

Experimental studies help identify the situational correlates of crime and 
deviance, such as frustration, threat, and reward. Furthermore, the research 
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literature shows similarities across deviance types. For example, the focus of 
research on restraints to aggression is similar to dishonesty research on risk 
of apprehension and rules governing the use of dishonest behavior. However, 
current knowledge is not systematically organized. Thus, compared to 
aggression researchers, dishonesty researchers pay more attention to reward; 
compared to dishonesty researchers, aggression researchers pay more attention 
to the role of models in producing imitative behavior. 

Experimental methods used to study situational aspects of crime and 
deviance also have several limitations. First, most experimental studies 
exclude data on subjects’ interpretations of situations (Alexander & Knight 
1971). In the absence of data on interpretations, even similar responses by 
subjects cannot be attributed to similar interpretations of situations. Second, 
much experimental research imposes situations on subjects, in marked contrast 
to the natural tendency of people to take an active role in selecting and creating 
the situations they enter (Wachtel 1973). Thus, even if there were uniform 
responses to situations, the prediction of crime would require an understanding 
of the ways in which crime-inducing situations are created. Third, Furnham 
& Argyle (1981:xxiv) list several ways in which laboratory experiments 
produce artefactual results, including experimenter effects, demand charac- 
teristics, response sets, and role playing. Finally, ethical considerations often 
limit the experimental method to the study of less serious forms of crime and 
deviance (Farrington 1979). 


SYMBOLIC INTERACTIONIST STUDIES OF CRIME 
AND DEVIANCE 


Symbolic interactionists have contributed to situational crime perspectives in 
two main ways. Most basically, like experimental psychologists and oppor- 
tunity theorists, interactionists (e.g. Cooley 1922, Thomas 1928, Mead 1934, 
Goffman 1959, Shibutani 1961, Blumer 1969) emphasize the obvious fact 
that all human action takes place in situations. Thus, rather than seek data 
from phenomena that are temporally and spatially distant from the empirical 
acts under study, symbolic interactionists assert that researchers must attend 
to individuals as they deal with the succession of situations actually confront- 
ing them. 

Second, unlike experimental psychologists and opportunity theorists, sym- 
bolic interactionists emphasize that situations are given meaning only through 
the subjective experiences of actors. Actors select, weigh, check, suspend, 
and transform the meanings of the situations they encounter. Interpretation is 
not an automatic application of previously established meanings, but rather a 
formative process in which new meanings are continuously developed and 
revised to guide behavior. Thus, rather than treat humans as neutral mediums 
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through which antecedent variables express themselves, symbolic inter- 
actionists assume that the key to understanding social behavior is found in 
the processes by which individuals actively interpret situations. 

These two elements—the objective fact of situations and their subjective 
definition by actors—are both important. To ignore the objective character- 
istics of situations (as most criminologists do) is to 1gnore the fact that apart 
from the psychological and social characteristics of individuals, situations also 
vary in terms of their ability to produce crime or deviance: thus, a large 
proportion of Milgram's (1974) subjects were willing to administer shocks; 
violent crimes are clearly concentrated in particular situations (Wolfgang 
- 1958, Amir 1971, Conklin 1972). But to ignore the processes by which actors 
subjectively interpret situations (as experimental psychologists and opportu- 
nity theorists generally do), is to ignore the fact that the same objective 
situation does not always lead to identical behavior: many of Milgram's 
subjects did not administer shocks; criminogenic situations do not always 
produce crime. 

Despite the centrality of situated behavior for interactionist theory, not all 
deviance research inspired by the perspective attends to situational variables. 
For example, most research on the link between self image and delinquency 
(Schwartz & Stryker 1970, Hagan & Palloni 1990, Matsueda 1992) concen- 
trates on the formation of self, but does not consider the characteristics of 
individual crime or deviance situations. Thus, Matsueda (1992:1581) argues 
that a "stable self" arises because the self-images called up in a particular 
situation resemble those called up from similar past situations. This approach 
has the substantial methodological advantage of holding the situation constant 
by assuming that it has no effect on behavior. Nonetheless, it is clear that 
regardless of how "stable" the self is, situations remain critical for the 
prediction of crime and especially, for specific forms of it. If this were not 
so, studies of the self would provide far more accurate prediction than they 
do. 

In recent years, symbolic interactionists (or those influenced by interaction- 
ist perspectives) have examined a variety of crime and deviance situations, 
including murder (Luckenbill 1977, Lejeune 1977, Athens, 1980, 1989, Katz 
1988), rape (Scully & Marolla 1985, Felson 1993), robbery (Letkemann 1973, 
Luckenbill 1981, Katz 1988, 1991), family violence (Denzin 1984, Loseke 
& Cahill 1984), pornography (Davis 1983), minor property offenses (Katz 
1988), drug use (Goode 1989), extramarital affairs (Atwater 1982), and gang 
membership (Katz 1988). Symbolic interaction theory contributes to a 
situational analysis of crime and deviance by focusing attention on the meaning 
of situations for actors, by linking offender motivation and situational 
opportunities, and by conceptualizing crime and deviance as situationally 
precarious outcomes. 
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Katz’s (1988) research illustrates the importance of attending to the actor’s 
subjective experience of deviant and criminal situations. His central assump- 
tion is that crime cannot be explained without a close examination of the 
situationally specific experiences of those who commit it. Katz studied the 
“seductive qualities” (1988:3) of crime and in so doing identified expressive 
dimensions of the crime event, above and beyond its instrumental functions. 
Thus, shoplifting and vandalism furnish “sneaky thrills” to adolescents (cf 
Matza & Sykes 1961, Atwater 1982, O’Malley & Mugford 1991); robbery 
contributes to the offender's self image as a "hardman" (Athens 1980, Katz 
1988). Katz emphasizes the ability of these and other crimes to create moral 
transcendence and even righteousness out of boredom, humiliation, and chaos. 
By paying close attention to the subjective meaning of crime and deviance 
for participants, he demonstrates that the goals of offenders are at once more 
contradictory, less well developed, and more complex than researchers have 
commonly assumed. 

Another important contribution of symbolic interaction theorists is their 
insistence that the motivation of offenders and the opportunity for crime and 
deviance are inextricably linked. Robbery provides a convenient example. 
Most researchers (Long & Witte 1981, Blau & Blau 1982, Chiricos 1987) 
assume that the motives for robbery are instrumental and hence, robbery rates 
should be responsive to economic measures such as inequality, unemploy- 
ment, and poverty, and to situational contingencies reflecting probable 
rewards and the risk of apprehension. But Katz's description (1988, 1991) of 
the realities of robbery situations makes these economic explanations seem 
suspect: robbers frequently victimize people they know, close to their home 
(making identification easier); most robberies involve little advance planning; 
offenders are frequently intoxicated or under the influence of drugs when they 
commit the crime; the typical robbery results in a relatively small profit, most 
of which is rapidly consumed; most offenders suffer frequent arrests, and if 
they persist, almost certainly spend long amounts of time in prison (cf 
Gottfredson & Hirschi 1990). Thus, the opportunities that offenders perceive 
for robbery depend on the subjective emphasis they place on expressive as 
well as instrumental dimensions. 

Symbolic interaction theory is also important for demonstrating the situa- 
tionally precarious nature of crime and deviance. Most studies of criminality 
treat crime as a foregone conclusion that either happens or does not happen. 
In contrast, interactionist research illustrates the often complex processes 
required to successfully commit crime or deviance. Thus, Luckenbill 
(1980:365) shows that the successful robbery requires a fundamental shift in 
the definition of the situation from one involving “routine. relatively tranquil” 
interaction to one involving robbery. The success of this attempt is by no 
means a foregone conclusion. Indeed, Katz (1988, 1991) describes in detail 
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how difficult it is to complete robbery successfully. Similarly, LeJeune & 
Alex (1973), Felson (1981), and Campbell & Gibbs (1986) stress the 
importance of "impression management" in successfully conducting violent 
transactions. 

In short, crime requires situational skills, and their level of development 
varies between individuals. Furthermore, offenders often compare their skills 
to the demands of the situation. For example, Athens (1980) asked convicted 
offenders to describe the situations leading up to their current incarceration 
and the situations 1n which they almost, but did not actually, commit violent 
crimes. He found that, because of the risks of apprehension, punishment, or 
social disapproval, individuals exercise "restraining judgments" (1980:31) on 
their violent intentions in many situations. 

Interactionist studies such as these demonstrate the complexity of situations, 
illustrate the consequences of adopting simplistic assumptions about situa- 
tions, and thereby clear the way for theoretical and methodological approaches 
to crime that examine the offender as a situationally grounded actor. Given 
the complexities of interaction between actors and situations, interactionists 
have generally rejected deductive, statistical analysis in favor of inductive, 
qualitative approaches. Interactionists (Blumer 1969, Athens 1980) are of 
course well aware of the methodological difficulties raised by the perspective 
and view it as a necessary cost of producing a theory that is more in tune 
with the world of lived experience. Nonetheless, the most serious limitations 
of symbolic interaction theory are those directly related to research methods. 
In particular, interactionist contributions are limited by their largely descrip- 
tive results, difficulties in measuring Situational variables and low 
generalizability. 

Interactionists’ reliance on descriptive methods sometimes produces results 
that are at a low level of abstraction and that make theoretical generalizations 
difficult. For example, one of Athens’ (1980:81) major conclusions is that 
“actors commit violent criminal acts only after they form violent interpreta- 
tions of the situations which confront them." While this conclusion may be 
empirically true, it borders on being trivial. Similarly, Katz's (1988:9) 
inductive approach leads him to develop “a different set of individually 
necessary and jointly sufficient conditions" for each type of crime and 
deviance he examines. This method of examining each crime separately makes 
it difficult for Katz to identify similarities between crimes. More generally, 
the task of developing a general theory of crime becomes impossible if, in 
Gottfredson & Hirschi's (1990:171) words, we need a separate “criminology 
for street crimes and another for white collar crimes, one criminology for 
murder and another for robbery." 

The methodological challenges of interactionist approaches are further 
exacerbated by the peculiar difficulties of studying crime and deviance. If 
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research is to move beyond description, researchers must be interested as 
much in situations that do not produce deviance as those that do. However, 
while it is feasible to study situations that result in crime—at least if offenders 
leave behind arrest or conviction records or are willing to self report—it is 
much more complex to gather information on the myriad situations that could 
have but did not result ın crime. For example, based on his analysis, Katz 
(1988, p. 5) argues that we must recognize the "sensual dynamics" of crime. 
But because Katz does not simultaneously examine situations that do not result 
in crime, his conclusion has a limited ability to predict crime: Do persons 
who do not engage in deviance also experience these dynamics? Are they 
more typical of deviant than nondeviant behavior? Are actors who do not 
commit deviance sometimes repulsed by their nondeviant behavior? With no 
data on situations that do not lead to deviance, answers to these and similar 
questions are impossible. 

Finally, and largely as a consequence of the foregoing, interactionist 
research has had low generalizability. The methods employed and the 
interpretation of results are so different as to make each study unique. Thus, 
Athens (1980) relies on a nonrandom sample of convicted offenders and his 
own firsthand observations; Katz (1988) relies on interviews, observations, 
and journalistic accounts. 

Despite their methodological and theoretical difficulties, interactionist 
studies have played an important role in bringing situations into the study of 
crime and deviance. Moreover, the interactionist emphasis on the actor's role 
in defining and interpreting situations serves as an important critique of 
research that either ignores offender motivation altogether or assumes that it 
always reflects instrumental goals. 


OPPORTUNITY THEORIES OF CRIME AND DEVIANCE 


The opportunity perspective in criminology is concerned with the incidence 
and location of crime events in social systems, and its theories are based on 
the premise that some situations are more favorable for crime than others. 
The origins of these theories are diverse, ranging from the study of 
victimization survey data (Hindelang et al 1978), to the application of human 
ecology (Cohen & Felson 1979), rational choice (Cornish & Clarke 1986) 
and economic (Cook 1986) perspectives. Nevertheless, a common element in 
all is an image of crime as a situated event. As Cohen & Felson (1979:589) 
expressed it, crime occurs when motivated offenders converge (in time and 
space) with suitable targets in the absence of capable guardians. 

Two kinds of opportunity theory have emerged. In one, which we call 
"situational selection," the focus is on the decisions made by offenders in 
choosing situations for crime. In the other, which we call "theories of 
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victimization,” the focus is on the characteristics and activities of individuals 
that contribute to their victimization. 


Theories of Situational Selection 


The term situational selection refers to the appraisal of a situation as suitable 
for crime. Some appraisals may be conducted rapidly, while others may 
require substantial time. Some appraisals follow the decision to commit a 
crime and thereby involve the search for the right situation, other appraisals 
actually precede the decision to commit a crime and involve the evaluation 
of only one situation (cf Maguire 1980, Bennett & Wright 1984). 

Numerous empirical studies demonstrate the existence of situational selec- 
tion by offenders. For example, McIntosh's (1971, 1975) historical study of 
the organization of crime in England shows that specific forms of crime such 
as sneak thieving are tied to the emergence of densely populated, relatively 
anonymous urban situations. Similarly, ethnographic literature on offenders 
(Chambliss 1972, Letkemann 1973) and systematic studies of offender 
decision-making help identify the role of situational selection in the genesis 
of crime. For example, Hough (1987) analyzed British Crime Survey data 
and found that burglars' choices of residences are partly influenced by the 
potential yield, household occupancy, and the availability and location of 
entry points (cf Cromwell et al 1991). Bennett & Wright (1984) obtained 
similar results using interviews and experimental techniques in a study of 
decision-making among a British sample of convicted burglars. 

Two important questions concerning situational selection are the extent to 
which it enters decision-making by offenders and the criteria of selection that 
offenders employ. Researchers have reached varying conclusions when they 
have studied the extent to which offenders plan their crimes. For example, 
Reppetto (1974) found evidence of planning among 75% of a sample of 
convicted burglars. Petersilia et al (1978) found that about 2596 of a sample 
of incarcerated armed robbers planned in detail, 50% planned some aspects 
of the crime, and 25% did not plan at all. Walsh (1986) found that 52% of 
a sample of British armed robbers planned the crime, but 1n a US sample 
Feeney (1986) found that only 15% planned their robberies. This variability 
undoubtedly derives from different sampling strategies, the focus on different 
crimes, different definitions of planning (none are made explicit), and different 
data collection methods. 

Moreover, the concepts of planning and situational selection differ in 
several important ways. First, planning implies premeditation, whereas some 
situational selection 1s not premeditated (Luckenbill 1981, Scully & Marolla 
1985). Second, planning can include more than situational selection such as, 
for example, the organization of several participants (Letkemann 1973). Third, 
planning implies deliberate consideration of the steps involved in committing 
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crime, whereas situational selection may be rapid and informal. Nevertheless, 
existing opportunity research is useful in suggesting that situational selection 
is likely to be more extensive among older offenders (Reppetto 1974), less 
extensive when the offender is impulsive (Letkemann 1973), and more rapid 
when the offender is experienced (Carroll & Weaver 1986). 

Empirical research also helps to identify some of the criteria employed by 
offenders in selecting situations. In general, ease of access to the target (Scarr 
1973, Brantingham & Brantingham 1975, Molumby 1976), risk of being 
observed or caught (Bennett & Wright 1984, Walsh 1986), and expected 
reward (Hough 1987) are considered by offenders. Another criterion is the 
ethical nature of the crime situation. For example, LaBeff et al (1990) found 
that college students develop situationally specific judgments about the 
appropriateness of cheating (cf Sykes & Matza 1957, Fletcher 1966). Research 
has also found that risk and accessibility are more prominent than probable 
reward in the decision-making process (Bennett & Wright 1984, Carroll & 
Weaver 1986, Gabor et al 1987). 

Several researchers have attempted to develop more formal propositions 
concerning the process of situational selection. Reppetto’s (1976) displace- 
ment hypothesis proposes that offenders may change the location, timing, 
target, tactic, or even type of crime in response to unfavorable changes in the 
situations usually chosen for crime (Cornish & Clarke 1987, Barr & Pease 
1990, Gabor 1990). Brantingham & Brantingham (1984) propose a theoretical 
model of crime site selection that integrates the findings of prior research on 
target selection. The model assumes the existence of motivated offenders, 
who use situational cues to locate suitable targets for crime. Rational choice 
theory (Cornish & Clarke 1986) envisions the selection of crime targets as a 
function of situational factors relating to opportunity, effort, and risk. An 
important premise in the rational choice perspective is that crime types differ 
in the way these situational factors affect the decision-making process. Finally, 
LaFree & Birkbeck (1991) developed a partial model of situational selection, 
incorporating the crime technique, the offender’s goals, and the situational 
characteristics of crime. Cross-national data presented by the authors largely 
confirm their hypotheses. 

To summarize, studies of situational selection have made a number of 
contributions to our understanding of crime. First, they demonstrate that 
offenders’ decisions are partially determined by situational contingencies. 
Second, they indicate the dimensions of situations that are frequently evaluated 
by offenders. Third, they show that the type of situation commonly chosen 
by offenders varies by crime type. Finally. they suggest that the level of 
attention to situational selection varies by offender and crime type. Future 
research on the nature and role of situational selection would benefit greatly 
from the conceptual integration of the various theoretical perspectives, and 
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the strengthening of links between theory and empirical research. This would 
provide a clearer understanding of offenders as situational actors and crimes 
as situated events. 


Theories of Victimization 


The most popular theory of victimization is Cohen et al’s (Cohen & Felson 
1979, Cohen et al 1981b) routine activity theory, considered one of the most 
important new perspectives in criminology (Pearson & Weiner 1985, Gibbons 
1985). Hindelang et al’s (1978, Garofalo 1987) lifestyle theory is a similar 
statement (see also Cook 1986). Congruent with theories of victimization, 
although considerably less sophisticated, is Jeffery’s (1971, Mayhew 1979) 
concept of defensible space. 

Theories of victimization reflect the provocative theme that individuals 
contribute to their own victimization (von Hentig 1948, Jensen & Brownfield 
1986, Fattah 1991). However, it is not the active contribution implied by 
victim-precipitation (Wolfgang 1958), but a causal link between the charac- 
teristics and activities of individuals and their rate of victimization. This link 
is established by employing a situational model of criminal behavior. 

Routine activity theory illustrates the typical approach. The dependent 
variable is the victimization rate, which can be measured longitudinally 
(Cohen et al 1980) or cross-sectionally for spatial units (Messner & Blau 
1987, Bennett 1991) and subsamples (Cohen et al 1981b). Researchers 
hypothesize that victimization rates are a function of risk factors, which refer 
to the situational interaction of potential targets and motivated offenders. 

The most commonly cited risk factors are exposure, guardianship, and 
attractiveness. Cohen et al (1981b:507—8) define exposure as “the physical 
visibility and accessibility of persons or objects to potential offenders at any 
given time or place,” guardianship as “the effectiveness of persons or objects 
in preventing violations from occurring,” and attractiveness as “the material 
or symbolic desirability of persons or property targets to potential offenders.” 

Risk factors appear as intervening variables in routine activity theory 
because they are assumed to be a function of target variables that describe 
potential victims. Contextual target variables generally refer to residential 
communities. Typical examples are residential housing density (Massey & 
McKean 1985, Sampson & Wooldredge 1987) and level of policing (Stahura 
& Sloan 1988). Individual target variables refer to sociodemographic charac- 
teristics such as gender, age, and income, or to activities such as employment, 
family status, or recreation. Sociodemographic variables are hypothesized to 
exert direct effects on some risk factors, but indirect effects on others, 
mediated by activity variables. 

The main hypothesis of routine activity theory is that targets are more likely 
to be victimized when they are more attractive, less well guarded, and more 
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frequently exposed to motivated offenders. This hypothesis has typically been 
tested with victimization survey data (e.g. Gottfredson 1984). These usually 
provide sociodemographic information on victims and nonvictims, basic 
information on the circumstances of the crime, and cccasionally (as in the 
US National Crime Survey— Victim Risk Supplement, 1983), information on 
activity patterns. Other studies have used police data (e.g. Messner & Tardiff 
1985). 

Initial tests of routine activity theory were largely supportive. For example, 
Cohen and associates found support for the theory in several studies of 
personal and property crime (Cohen & Felson 1979, Cohen et al 1980, Cohen 
et al 1981a, 1981b, Felson & Cohen 1981). Since then, other researchers 
have also reported confirmatory findings (e.g. Maxfield 1987). This would 
seem to validate the situational model that is at the core of routine activity 
theory and to demonstrate its utihty for explaining crime rates. However, 
several problems with the theory have sparked additional debate and research. 

First, the use of sociodemographic or activity variables as proxy measures 
for risk factors has often been based on ambiguous assumptions. In the absence 
of data, these relationships can be formulated to support any empirical 
outcome and the theory becomes unfalsifiable (Miethe et al 1987). For 
example, Cohen et al (1981b:511) claim that high income 1s associated with 
greater guardianship because more protection can be purchased and because 
marriage is (economically) more feasible and places the individual in greater 
contact with significant others. By contrast, they argue that young people are 
less likely to enjoy guardianship because they spend more time outside the 
home than older people and hence are more accessible to offenders. But the 
fact that different dimensions of guardianship are emphasized for each group 
leaves room for contradictions. For example, because higher income may well 
be associated with greater levels of activity outside the home, compared to 
low income groups, high income groups may be more accessible to offenders; 
and because compared to older people, younger people may be more 
frequently in the company of significant others, they may be better guarded. 

Second, when more direct measures of risk factors are used, risk variables 
are not uniformly related to victimization rates. For example, Miethe & Meier 
(1990) examined data from the British Crime Survey and found that proximity 
and exposure variables are more consistently related to victimization than 
attractiveness and guardianship vanables. Similarly, using data on property 
victimization in Atlanta, Massey et al (1989) found that one measure of 
exposure to offenders and one of target suitability were significantly associated 
with victimization risk, although more refined measures of guardianship were 
not. Massey et al conclude (1989:396) that "support for the routine activities 
approach is weak at best" (cf Miethe et al 1991). 

Finally, many tests of routine activity theory employ broadly defined classes 
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of crime that are situationally heterogeneous and unlikely to be explained by 
one activity model (Lynch 1987). For example, Miethe et al (1987) test for 
relations between the frequency of nighttime excursions for entertainment and 
the victimization rate for violent and property crimes irrespective of their 
timing and location. The problem is that some crimes at night do not occur 
while people are out on nighttime excursions (e.g. they may take place in the 
home). Thus, Miethe et al use a measure of risk that is conceptually linked 
only to a subset of victimizations included in the model. As Lynch points 
out, the explanatory power of such models is likely to be low. He suggests 
that domain-specific models of victimization be developed (cf Cohen & Felson 
1979). r 

From the perspective of sıtuational analysis, several additional criticisms 
of victimization theories become clear. Most of these are directly related to 
the decision by victimization theorists (Cohen & Felson 1979, Cohen & Land 
1987) to emphasize victim rather than offender behavior to explain crime. 
First, theories of victimization often posit simplistic models of situational 
selection. For example, routine activity theory rarely allows for differences 
between offenders in the level of attention to situational selection (Miethe et 
al 1987). It treats access to the target, risk of detection and apprehension, and 
likely reward as equal criteria, and the only criteria, employed by offenders 
in situational selection. And it pays little attention to situational differences 
between crime types. As a result, risk factors may be defined and measured 
inappropriately, and their role in explaining victimization may be unclear. 

Second, victims are rarely an accurate source of data on the process of 
situational selection by offenders. Victims can often describe situations in 
which they were victimized, but they cannot know if other situations and 
targets were also appraised by offenders and if so, with what criteria, or 
whether they have been appraised as potential victims in the past and not 
selected. Accurate information on the interaction between potential offenders 
and victims is important for testing victimization theories, but it cannot be 
supplied by respondents in victim surveys. 

Finally, because offenders, not victims, commit crimes, statements about 
the risk of victimization are merely restatements of hypotheses about 
situational selection by offenders. Thus, hypotheses about the probability of 
victimization can be trivial (Gottfredson 1981). For example, purse snatching 
occurs mainly in crowded public places and involves single victims who are 
generally easier to victimize (e.g. the old, the young, women—LaFree & 
Birkbeck 1991). Hence, the specification of exposure to snatching requires a 
simple repetition of these elements, that is, the more frequently that the old, 
the young, or women are in crowded public places, the greater is their exposure 
to snatching. Moreover, the focus on the victim leads to the restatement of 
such hypotheses in terms of specific victim or target characteristics, for 
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example, daily activities or target hardening. Rather than encourage concep- 
tual abstraction, theories of victimization lead in the opposite direction, and 
as Skogan (1981:735) points out: “There can be no science of door locking 
and property marking.” 

To summarize, victimization theorists have attempted to explain patterns 
of victimization based on a situational model of offender behavior not entirely 
supported by studies of situational selection. Further progress in this area is 
more likely to be achieved by focusing on offenders’ decision-making in 
selecting situations rather than through examining the characteristics and 
activities of victims. 


DISCUSSION AND CONCLUSION 


Our objective in this essay has been to review the three research literatures 
that have been most active in conceptualizing and measuring situational 
dimensions of crime and deviance. Experimental, symbolic interactionist, and 
opportunity research have largely developed independently of each other, 
although they have a common interest in situational analysis. In this section 
we examine the contributions of these three perspectives to an understanding 
of crime and discuss some implications forfurther research. We consider two 
issues: the general role of situations in the explanation of crime, and the 
specific contributions of situational analysis to the understanding of motivation 
and opportunity. 


Situations and the Explanation of Crime 


Situational analysis in criminology focuses on the crime-producing effects of 
physical and social stimuli captured by individuals from the immediate setting. 
Our review suggests that researchers interested in situational analysis have 
developed two perspectives on the role of situations in explaining crime. First, 
many experimental researchers on aggression and opportunity theorists treat 
the situation as an objective entity. They concentrate on the identification of 
situational variables that exert a similar influence on broad classes of 
individuals. Second, many experimental researchers on dishonesty and 
symbolic interactionists treat the situation as a subjective creation of the 
individual, constructed from the selective perception and interpretation of the 
immediate setting. 

We argue that the first approach is useful for identifying important 
situational influences on behavior but is nevertheless of more limited utility 
than the second. This 1s because, as Mischel (1973) points out, situations vary 
in the extent to which they constrain behavioral choices. Thus, “weak” 
situations may produce different reactions among subjects because of differing 
perceptions (Stebbins 1981), evaluations of appropriate behavior, predictions 
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of outcomes, and levels of self-regulation (Mischel 1973). Studies that focus 
on the objective characteristics of situations are therefore likely to produce 
limited generalizations about crime. 

The second approach suggests that the immediate setting is only relevant 
to understanding crime when examined from the offender’s perspective. The 
main contribution of this research is not to demonstrate the overwhelming 
influence of situations on behavior, but to focus attention on the role of 
situational variables in offender decision-making. Thus, the offender continues 
to play a central role in the production of crime. 

The second approach might initially appear to be nothing more than a return 
to criminology’s traditional emphasis on offenders, but that is not the case. 
Compared to other studies of offenders, situational analysis pays significantly 
more attention to offender decision-making, to variables that are spatio-tem- 
porally proximate to the empirical acts under study, and to the role of 
opportunities. The main contribution of situational analysis to criminology 
involves the identification of patterns of interaction between individuals and 
situations in the genesis of decisions to commit crime. 

Psychologists may make some important contributions here. The inter- 
actionist debate in psychology has stimulated the development of several new 
methods for studying the situational dimensions of behavior (Krahé 1990), 
involving quantitative studies of the interaction between person and situation 
variables (Malloy & Kenny 1986, Ozer 1986), studies of the cognitive 
representation of situations (Forgas 1979) and individual-centered strategies 
(Buss & Craik 1983, Bem & Funder 1978). Their importance lies in the 
attempt to conceptualize personality in terms of the individual’s mode of 
interaction with situations; they treat the individual as a situational actor. 


Situations, Motivation, and Opportunity 


Situational analysis suggests that situations affect crime in two main ways. 
First, experimental and symbolic interactionist studies emphasize the role of 
Situations in motivating individuals to commit crime by imposing negative 
experiences such as frustration, threats, humiliation, and boredom; by offering 
positive attractions such as money, property, image-building, thrills and 
sexual satisfaction; or by providing models to be imitated. Second, theories 
of situational selection and victimization emphasize the role of situations in 
affecting the extent to which criminal motivations can be realized. Situations 
provide varying rewards and punishments, probabilities of success, and ethical 
consequences for an offender with a given objective, skill level, and moral 
outlook. This second dimension of situational influence is generally referred 
to as opportunity. 

Experience suggests that criminologists have found it easier to conceive of 
situations as opportunities for crime than as motivating factors. The distinction 


SITUATIONAL ANALYSIS 131 


between criminality and crime, which is essentially a strategy to separate 
motivation and opportunity, 1s found in many criminological theories (Suth- 
, erland 1947, Coleman 1987) and is an explicit premise of opportunity theories. 
As we have seen, the latter simply assume that motivation exists and center 
attention on the role of opportunities in crime. 

Thus, one task for future research is to examine patterns of interaction 
between offenders and situations in the genesis of opportunities for crime. 
For example, Mischel (1973:275) has identified five variables that characterize 
the individual's interaction with situations: (i) behavioral construction com- 
petencies, (ii) encoding and categorization of situations, (iti) behavior-out- 
come expectancies, (iv) subjective evaluations of expected outcomes, and (v) 
self-regulatory systems and plans. These variables could be studied in relation 
to the individual's identification of appropriate opportunities for crime. They 
offer a potentially useful framework for theories of situational selection. 

This kind of theoretical development could help clarify some aspects of 
offender motivation, for if motivation involves a willingness to commit crime, 
then it is surely contingent upon opportunities. People are normally unwilling 
to act without taking at least a cursory glance at the circumstances in which 
they find themselves. Thus, we argue that part of motivation is a plan of 
action involving a specific configuration of situational variables (cf 
Brantingham & Brantingham 1984). For example, the motivation of burglars 
may lie 1n their willingness to secretly force entry into unoccupied homes and 
remove items for resale. If burglars perceive that one of these situational 
characteristics is not present (for example, someone is at home), then their 
motivation may diminish. It is therefore difficult to study crime opportunities 
without also examining offender motivation. 

Far more difficult has been the attempt to discover the causes of such plans 
of action. With what frequency do they appear, and why? Although 
experimental research has identified many variables that can motivate crime, 
its utility has so far been limited. First, experimental studies have only 
examined the link between selected situational variables and behavior. They 
pay little attention to the manner in which situational variables affect plans 
for action. Second, plans for action may be formed in one situation but not 
implemented until another, more appropriate, situation 1s encountered. This 
possibility is lost in most experimental research, which focuses on stimuli and 
outcomes in one situation. Third, non-situational phenomena, such as thoughts 
unconnected with the current situation, may motivate crime. Thus, although 
we cast part of motivation as a strategy for situated action, motivation is not 
always situationally induced. The situational perspective cannot therefore 
provide a comprehensive study of motivation, but it can encourage the study 
of criminality in terms more congruent with a vision of offenders as situated 
actors (cf Miethe & Meier 1990). 
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We began this review with the distinction between crime and criminality. 
Dissatisfaction with theories of criminality springs largely from their poten- 
tially weak link with the number of criminal acts committed. The situational 
analysis of crime cannot entirely avoid the same problem, because the 
explanation of criminal acts demands that we focus on both the sources of 
motivations and opportunities. Moreover, the difficulties of tracking the 
situational experiences of offenders and nonoffenders suggest that situational 
explanations of crime are more likely to be successful when formulated and 
tested with experimental methods. The merit of situational studies is to 
underscore the “factorial complexity” (Hirschi & Gottfredson 1986:58) of real 
counts of criminal acts and to focus greater attention on offender decision- 
making. 
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Abstract 


We review the theoretical models and the research on self-esteem among 
Hispanic and Asian American subgroups and compare these findings to the 
existing literature on African American self-image. Group self-esteem refers 
to how the individual feels about racial or ethnic group membership. Personal 
self-esteem refers to how the individual feels about the self in a comprehensive 
manner. We describe the major paradigms of ethnic/racial and personal 
self-esteem utilized in studies of Hispanics and Asian Americans. These 
paradigms are largely informed by the literature on ethnicity and stress the 
macrostructural forces that affect self-concept. Paradigms of African Ameri- 
can self-image, however, tend to focus more on the psychological mechanisms 
that transform social context into personal identity. We also review empirical 
evidence on both dimensions of self-esteem among Hispanics and Asian 
Americans, and we contrast these findings to research on African Americans. 
We conclude by suggesting parallels between the theories dealing with 
ethnicity and those dealing with race, and we suggest areas for further 
theoretical integration and empirical research. 


Racial and personal self-esteem have been extensively investigated for 
African American populations (Porter & Washington 1979, 1989). There is, 
however, no synthesis of the literature on self-esteem among American 
Hispanic and Asian subgroups, nor have these findings been compared to the 
literature on African American self-image. We review the theory and research 
that have either direct or indirect implications for Hispanic and Asian 
American self-esteem. Although the literature on these topics is growing, the 
discussion of Hispanic and Asian American self-esteem tends to be informed 
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by theoretical models that view self-esteem as a derivative of group processes, 
without directly testing these relationships. We briefly compare the Hispanic 
and Asian American literature to the work on African American self-concept 
and conclude by suggesting ways that these literatures can inform one another 
both theoretically and empirically. 

Though the ethnic and race literatures have some convergences, they 
employ largely different theoretical paradigms in the area of psychological 
identity. An ethnic group is a subgroup whose members are perceived by 
themselves and others to have a common origin and culture, and shared 
activities in which the common origin or culture is an essential ingredient 
(Yinger 1985). While ethnicity is constructed primarily around social and 
cultural characteristics, race is based primarily on physical characteristics. 
Hispanic and Asian American are viewed both as ethnic and racial categories; 
in the case of Hispanics, American black/white racial distinctions are often 
imposed on them by others. 

Self-esteem refers to how one evaluates the self. For purposes of analysis, 
we divide self-esteem into two components: group and personal. Group 
self-esteem refers to how the individual feels about his or her ethnic or racial 
group membership. By personal self-esteem we mean esteem for one’s 
individuality, regardless of racial or ethnic group; how one feels about the 
self in a comprehensive manner (Porter & Washington 1979, 1989). Although 
the research on African American identity has shifted in the last decade from 
an investigation of racial to an investigation of personal self-esteem, most of 
the discussion of Hispanic and Asian American identity focuses on ethnic 
group rather than personal self-esteem. 


CATEGORIZATION OF HISPANIC AND ASIAN 
AMERICAN SUBGROUPS 


The global categories “Hispanic” and “Asian” are socially constructed 
categories. The use of global terms to encompass disparate groups is of limited 
utility because of the demographic and cultural differences among the various 
Asian American and Hispanic subgroups, including differences in immigration 
history and mode of incorporation into American society. Among Hispanics, 
Mexican-Americans are the most heterogeneous subgroup in terms of socio- 
economic characteristics and generational composition. Puerto Ricans have 
the lowest labor force participation rate, highest unemployment level, highest 
incidence of poverty, and lowest level of education among the Hispanic 
subgroups. Cubans have the highest median income, the lowest unemploy- 
ment, and the lowest incidence of poverty compared to Mexican Americans 
and Puerto Ricans. These Hispanic populations also differ in geographic 
distribution, with Mexican Americans concentrated in the west and southwest, 
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Puerto Ricans in the Northeast, and Cubans in Florida (Bean & Tienda 1987, 
Maldonado 1991). Among Asian Americans, Koreans and Indochinese have 
the highest proportion of foreign-born; Vietnamese and other Indochinese have 
the lowest level of education and income and the highest rate of poverty (Vega 
& Rumbaut 1991). 

At present, national loyalties are too deeply internalized and the social and 
cultural experiences and history of each nationality are too divergent for these 
global terms to be the basis of identity and group self-image. Mexican 
Americans, Cubans, and Puerto Ricans may be aware that they share a 
common language and cultural roots, but this fact does not suffice to produce 
strong intergroup ethnic solidarity. Among Asians, the gap between global 
ethnic labels and reality is even greater because these groups do not share a 
common language (Bean & Tienda 1987, Portes & Rumbaut 1990, Aguirre 
& Saenz 1991, Maldonado 1991). The terms “Hispanic” and “Asian Ameri- 
can" are related more to government labels and to the perceptions of 
Anglo-American society than they are to common experiences or identity 
among the individuals so designated (Portes & Truelove 1987). These global 
labels are essentially political rather than cultural ethnic labels. They may 
serve the function of mobilizing individuals in diverse groups to solve their 
common problems and to gain access to governmental programs earmarked 
for disadvantaged racial and ethnic groups. Thus both a national origin and 
a pan-ethnic identity are possible. Which is most salient is determined by 
situational and contextual factors, but currently the pan-ethnic identity 
probably has less personal relevance for self-esteem than does the subgroup 
identity (Padilla 1985, Bean & Tienda 1987, Kendis 1989, Murguia 1991). 
This may change in the next several decades. Research on identity and 
self-esteem focuses on particular nationality subgroups, not on Asian Ameri- 
cans or Hispanics as a whole; in fact, these subgroups are typically studied 
in isolation from one another. 

Hispanic and Asian subgroups differ from white ethnics and African 
Americans. Unlike white ethnics, Hispanics and Asians have been the objects 
of racial discrimination. This racial discrimination accounts in part for their 
retention of an ethnic identity despite their long years of settlement in the 
United States and loss of ethnic language by some members of the group 
(Arce 1981, Bean & Tienda 1987, Rodriguez 1989). Unlike African Ameri- 
cans, many members of these groups voluntarily immigrated to the United 
States, utilized ethnic languages, and though subjected to economic hardships, 
lacked the experience of enslavement. Also, circular migration between the 
United States and the country of ongin for some subgroups and the recent 
arrival of new streams of immigration for others further differentiate Hispanics 
and Asians from white ethnics and African Americans (Bean & Tienda 1987, 
Portes & Truelove 1987). 
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We concentrate in this review on the subgroups most frequently studied in 
terms of self-esteem: Mexican Americans, Puerto Ricans, and Cubans among 
Hispanics and Chinese, Japanese, and Korean Americans among Asians. Of 
Asian-origin groups in the United States, the 1990 census indicates that 
approximately 2396 are of Chinese origin, 1296 are of Japanese origin, and 
11% are of Korean origin, as well as 19% Filipino, 8% Vietnamese, 11% 
Asian Indian, and 16% Hawaiian, Pacific Islander or “other Asian.” Among 
mainland Hispanic groups, approximately 6096 are of Mexican origin, 1296 
are Puerto Rican, 5% are Cuban, and 23% are "other Hispanic" (World 
Almanac 1991). 


GROUP IMAGE: ETHNIC/RACIAL SELF-ESTEEM 


Hispanics and Asian Americans: Ethnicity Paradigms of 
Group Image 


Literature on self-esteem among Hispanics and Asian Americans has tended 
to focus on the group-image component, or how the individual feels about 
his or her group identity. We summarize major cultural and structural 
paradigms of ethnicity that have been used to analyze group identity among 
Hispanics and Asian Americans, and thereafter, we briefly contrast these 
paradigms to the racial paradigms used to explain group identity among 
African Americans. 


ACCULTURATION/ASSIMILATION PARADIGM OF GROUP IMAGE Among His- 
panics and Ásian Americans, the acculturation and assimilation paradigms 
have been among the most prevalent paradigms informing the literature on 
ethnicity and group self-image. The most widely used model is Milton Gor- 
don's (1964) paradigm, which views assimilation as being composed of seven 
dimensions including cultural, structural, and identificational assimilation as 
well as several other dimensions. Another widely used conceptualization is 
Keefe & Padilla's (1987) division of acculturation into cultural awareness and 
ethnic loyalty, which differ from structural measures of incorporation. 
Three basic approaches to acculturation/assimilation and their explicit or 
implicit effects on ethnic self-esteem have been used most often to study 
Hispanic and Asian American subgroups. The first is the "single continuum 
model," in which ethnic and mainstream cultures are treated as mutually 
exclusive. In this paradigm, acculturation refers to the loss of traditional traits 
and acceptance of new cultural traits. Assimilation refers to economic, 
political, and social integration of the ethnic group into the mainstream 
society. Moreover, this model assumes that acculturation precedes assimila- 
tion (Yinger 1985). As alternate norms and associations are acquired, 
traditional ones are relinquished (Martinez & Mendoza 1984, Keefe & Padilla 
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1987). Ethnic identification is viewed as a separate dimension from accultur- 
ation and structural assimilation: it is the feeling about group membership and 
identity with the group. Studies of the Mexican American, Chinese American, 
and Japanese American communities have used this paradigm (for instance, 
Kitano 1969, Grebler et al 1970, Montero 1981, Shih-Shan 1986). This single 
continuum model implies that unilinear assimilation and acculturation will 
result in a loss or weakening of ethnic identity. It assumes that the more 
acculturated and assimilated the individuals, the less positive their responses 
will be to the group image component of self-esteem. 

A second model of assimilation and acculturation is the bi-cultural or 
two-dimensional model. This model regards identity with the host culture and 
the culture of origin as separate dimensions; each culture exists along a single 
continuum and individuals may vary not only in their adherence to the two 
cultures but also in their social associations within the two groups. The 
individual can accommodate to the host culture or social structure and still 
retain the culture or associations of the original ethnic group (Padilla 1980, 
Szapocznik & Kurtines 1980, Martinez & Mendoza 1984, Keefe & Padilla 
1987, Rodriguez 1989). This model, however, has been replaced recently by 
a third model, the multidimensional or pluralistic model of acculturation and 
assimilation, in which acculturation and assimilation are treated as complex, 
multifaceted phenomena. The acceptance of new cultural traits or social 
associations and retention of traditional cultural traits and social associations 
are viewed as varying from trait to trait, with the result that assimilation and 
acculturation are regarded as complex and situationally dictated processes. 
For instance, an individual may speak the ethnic language at home and English 
at work, or associate with Anglo Americans in the work situation but members 
of the ethnic group at home. There is some suggestion that these patterns may 
form a new cultural variant rather than an amalgam, but this position is not 
clearly articulated (Levine & Montero 1973, Martinez & Mendoza 1984, Hurh 
& Kim 1984, Keefe & Padilla 1987, Murguia 1989). In both the bicultural 
and the multidimensional or pluralistic perspectives, ethnic pride and identi- 
fication can persist even when awareness of traditional culture weakens (Keefe 
& Padila 1987), due to factors like the importance of ethnic voluntary 
organizations and for some groups, an influx of new immigrants (Hirschman 
& Wong 1981, Romo & Romo 1985, Massey et al 1987, Padilla 1987, Fugita 
& O’Brien 1991) Typologies combining the dimensions of acculturation and 
ethnic identification have been proposed by several investigators (Padilla 
1980, Martinez & Mendoza 1984, Keefe & Padilla 1987). 

Although the assimilation/acculturation model has become a popular model 
for explaining the group-image component of self-esteem among Hispanic 
and Asian Ámerican subgroups, 1t has several shortcomings. Acculturation, 
which denotes changing values, is often measured by structural variables like 
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family form, intermarriage, or social association rather than by components 
of culture. Also, the implications of these models for group image are poorly 
articulated; they fail to explain the psychological processes that maintain 
ethnic loyalty or group pride. Finally, they fail to address adequately the social 
context of acculturation, assimilation, and group identity. Though the 
paradigms focus on culture or social associations, they do not articulate 
sufficiently the different macro-social contexts experienced by the various 
subgroups and the possible effects of these contexts on the group image 
component of self-esteem. 


INTERNAL COLONIALISM PARADIGM OF GROUP IMAGE The internal colonial- 
ism perspective addresses the effect of macrostructural forces on group image 
specifically. Stimulated by Memmi’s (1965) and Fanon’s (1968) writings on 
the effects of European colonialism, the model applies the core/periphery 
analysis of world systems theory to groups within a single country. It suggests 
that economic imperialism is reinforced by cultural imperialism, with the 
result that colonized groups who are relegated to low status and low reward 
occupations develop a negative group image. Because of the convergence of 
shared experiences and class interests, internal colonialism eventually pro- 
duces ethnic solidarity. These movements occur primarily among the second 
or third generation (Olzak 1983, Vigil 1984, Murguia 1989). Blauner (1972) 
has applied this colonial perspective to black/white relations in the United 
States. However, this model, as evidenced in the literature, has been used far 
more frequently to explain group image and identity of some Hispanic 
subgroups than of Asian subgroups or of Afncan Americans. The internal 
colonialism model was widely utilized to analyze Chicano nationalism ın the 
1960s (Moore 1972, Barrera 1979) when ethnic culture and pride were used 
as symbolic forces for promoting social change. The stress on both the Spanish 
language and ethnic identification labels (Chicano rather than Mexican 
American) emerged as pervasive signs of altered group consciousness and as 
symbols for mobilizing a decolonization movement (Vigil 1984, Blea 1988, 
Gomez-Quinones 1990, Murguia 1991). Internal colonialism was also used 
to explain a resurgence of ethnic militancy among Puerto Ricans in the late 
1960s (Maldonado-Denis 1976, Padilla 1987). 

This paradigm is applied primarily to groups like Mexican Americans and 
Puerto Ricans who have experienced extensive labor market discrimination 
and high rates of poverty. Murguia (1989) has suggested that a decolonization 
movement occurs among subgroups whose main mode of entry is following 
conquest, whose population is large in size, who are numerically predominant 
in certain geographical areas, and whose race, culture, and religion diverge 
from the dominant society. This paradigm is 1napplicable to Cubans because 
of their mode of entry as political refugees and the protection afforded by an 
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ethnic enclave economy (Acosta-Belen 1988). Nor has it been a salient 
paradigm in addressing identity movements among Chinese, Japanese, and 
Korean American subgroups. 

Despite its appeal to Chicano and Puerto Rican groups, this theoretical 
model has a number of problems as an explanation of the rise of movements 
promoting ethnic self-esteem. It does not take into account the mode of entry 
or labor market incorporation of various ethnic groups. It views class interests 
as being expressed in ethnic terms; however, factors like circular migration, 
location in segmented job sectors, and residential segregation may sustain 
ethnic community, culture, and pride among those who are not politically 
militant (Rodriguez 1989). Also, the paradigm fails to explain why the 
Chicano and Puerto Rican identity movements of the 1960s were not sustained 
over time (Munoz 1989). 


ETHNIC COMPETITION PARADIGM OF GROUP IMAGE A recent model explain- 
ing the maintenance of ethnic identity and self-esteem is the ethnic resilience 
or ethnic competition model. Portes (1984) suggests that it is contact and 
competition with outside groups rather than confinement in one's own 
community that leads to ethnic awareness and mobilization. A heightened 
sense of ethnic solidarity occurs among minority immigrant groups who have 
achieved considerable socioeconomic assimilation but whose pathway to total 
acceptance and equality remains blocked (Olzak 1983, Portes & Rumbaut 
1990). Either explicitly or implicitly, this model is used to explain the 
continuation or resurgence of ethnic identity and self-esteem among upwardly 
mobile Cubans (Rogg & Cooney 1980, Portes & Bach 1985); Mexican 
Americans (Hurtado & Gurin 1987), Puerto Ricans (Rogler 1972, Safa 1988); 
Japanese Americans (Feagin & Fuyitaki 1972, Takaki 1989); and Chinese 
Americans (Yee 1973, Fong 1973, Wong 1982). 

The ethnic competition model has both similanties and differences from 
the internal colonialism model in explaining group self-esteem. Both models 
view ethnic solidarity and by implication group esteem as a reaction to 
discrimination that shapes the identity of the second generation of various 
ethnic groups. But the two models differ in their perspective on the class basis 
of ethnic pride; the ethnic competition paradigm views ethnic mobilization as 
being greater in a socially mobile group, while the internal colonialism 
paradigm views it as being greater in lower class groups. Also, the internal 
colonialism model is linked more directly to class conflict and exploitation 
and the ethnic competition paradigm to economic competition and discrimi- 
nation. Neither paradigm addresses ethnic self-esteem as a phenomenon that 
can be generated by ethnic enclosure or recency of immigration (Keefe & 
Padilla 1987). Moreover, neither considers the psychological mechanisms 
through which group mobilization is transformed into individual self-esteem; 
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rather, they emphasize ethnic mobilization while regarding individual self-es- 
teem as merely derivative of group process. Sources of ethnic self-esteem 
within any ethnic subgroup may differ; for example, both first generation 
immigrants and residentially or economically isolated subgroups may derive 
a positive group image from their lack of assimilation. Overall, the ethnic 
competition model might best explain ethnic self-esteem among the upwardly 
mobile, while the internal colonialism model may more effectively explain 
ethnic self-esteem among young, second generation, low income populations. 
In any case, the implications of both the ethnic competition paradigm and the 
internal colonialism paradigm for the group-image component of self-esteem 
need to be studied empirically by measuring dimensions like group image, 
assimilation and acculturation on a variety of traits, perception of prejudice 
and discrimination, and alienation from Anglo American society. Both interna] 
colonialism and ethnic competition presuppose multicultural assimilation; 
however, they do not distinguish the different dimensions of either accultur- 
ation or assimilation, and they fail to explain the way these dimensions affect 
group mobilization and identification. 


LABOR MARKET INCORPORATION PARADIGM OF GROUP IMAGE Another prev- 
alent paradigm utilizes economic theories that explain the persistence of ethnic 
boundaries by stressing the role of ethnic economic institutions in maintaining 
group identity. These paradigms imply that ethnic solidarity created by 
economic relations will have a positive impact on the group image dimension 
of self concept. Also, they assume that segregation into ethnically controlled 
occupations impedes assimilation to the larger society and creates ethnic 
solidarity (Olzak 1983). 

This paradigm is exemplified by the growing literature on ethnic economic 
enclaves, or areas of concentrated immigrant entrepreneurship. In these 
immigrant enterprises, group members do not have to go beyond the physical 
and social limits of the ethnic enclave to carry out their routine activities. 
Simply put, economic mobility does not entail cultural integration, since the 
enclave develops an array of institutions to preserve and defend cultural 
identity against external pressure (Portes & Bach 1985, Bean & Tienda 1987, 
Safa 1988). This causal relationship among ethnic organizations, language 
retention, and ethnic solidarity in an ethnic economic enclave may be 
enhanced by the degree of discrimination minority group members encounter 
in the larger society (Olzak 1983). According to this paradigm, the group 
image component of self-esteem will be positive for workers located in an 
ethnic enclave economy. Ethnic enclave economies of this type characterize 
the labor market incorporation of Cubans (Portes & Bach 1985, Bean & Tienda 
1987), and of Chinese, Japanese, and some groups of Koreans (Wong 1985, 
Light & Bonacich 1988, Portes & Rumbaut 1990). 
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Tightly integrated ethnic economic enclaves are not, however, the only 
domains of ethnic entrepreneurship. For example, where the concentration of 
an ethnic group is less dense, ethnic entrepreneurs may take over businesses 
catering to low income groups in inner cities (Portes & Rumbaut 1990). This 
“middleman minority” business niche may also reinforce ethnic solidarity and 
ethnic self-esteem, because these businesses tend to employ fellow ethnics 
and often depend on ethnic structures such as credit associations (Light 1972, 
Bonacich & Modell 1980) and values such as ethnic honor (Light 1972). 
Middleman minorities tend to be recent immigrants. The relationship between 
“middleman minority” enterprises and ethnic solidarity has been reported for 
Chinese Americans (Light 1972, Takaki 1989); Japanese Americans (Light 
1972; Bonacich & Modell 1980; Takaki 1989); and Koreans outside of Los 
Angeles (Takaki 1989). However, this paradigm has been criticized as having 
more applicability to Korean than to Chinese or Japanese American enterprises 
(Wong 1985). 

Although incorporation into ethnic enterprise has implications for ethnic 
solidarity and positive self-concept, the effect of economic incorporation into 
secondary labor markets on ethnic self-esteem is less clear. Groups that enter 
secondary labor markets in economies that are in structural decline are 
economically vulnerable and experience high rates of poverty and unemploy- 
ment and other deprivations. This pattern is evident among Puerto Ricans 1n 
the northeast (Portes & Truelove 1987, Rodriguez & Melendez 1992) and 
among some Mexican American subgroups (Grebler et al 1970). There is, 
however, little research on the effect of such forms of labor force incorporation 
on feelings about group image. 

The form of labor market incorporation 1s a useful perspective for analyzing 
variation in the group image component of self-esteem; however, discussions 
based on such paradigms have problems when they are extrapolated to explain 
individual-level variables. Individuals may vary in the group image dimension 
of self-esteem even when their subgroup is characterized by a bigh degree of 
economic and ethnic solidarity. These discussions also fail to explain why 
ethnic loyalty persists even when economic solidarity evaporates (Keefe & 
Padilla 1987). That is because the effect of ethnic loyalty on self-esteem is 
inferred rather than empirically demonstrated. 


SUMMARY.ETHNIC PARADIGMS:HISPANIC AND ASIAN GROUP IMAGE The ac- 
culturation/assimilation paradigms and the explicitly structural models like 
internal colonialism, ethnic competition, and labor market incorporation 
provide important insights into processes that form the group image component 
of self-concept. They explicitly address the cultural and structural forces that 
determine how the group views itself and is viewed by others. All of these 
ethnic paradigms also have limitations, however. They do not clearly 
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distinguish ethnic mobilization, ethnic solidarity, and ethnic self-esteem, nor 
do they explain the psychological mechanisms that link personal identity to 
culture and social structure. In short, they fail to explain why individuals 
exposed to the same social forces within a given ethnic group vary in their 
identification with and evaluation of the group. To some extent, this is also 
true of the ethnic assimilation/acculturation paradigms which address them- 
selves to ethnic solidarity but do not provide general psychological models 
that explain how group identity is transformed into individual identity. The 
effect of the explicitly structural factors on personal identity has not been 
thoroughly tested, nor has there been an attempt to combine the structural and 
assimilation/acculturation paradigms into a more general model predicting 
self-esteem. 


African Americans: Racial Paradigms of Group Image 


The ethnicity paradigms described above are the major models that have been 
used to explain the group image component of self-esteem among Asian 
Americans and Hispanics. Although there is an extensive sociological 
literature on African Americans and a growing sociological literature on 
self-esteem, the specific body of literature on African American group image 
has been informed largely by models from the field of psychology. 

Although early research on African American self-esteem assumed that the 
racial self-esteem of African Americans was invariably damaged, recent 
studies have been guided by three major social-psychological paradigms; 
relative deprivation, alienation, and the role of subculture. 

Relative deprivation theory was a popular paradigm during the 1960s and 
1970s, proposed mainly by social psychologists. This theory assumed that 
rapid socioeconomic change led to nsing expectations and a shift in reference 
groups. Blacks, comparing their group with whites, felt relatively deprived 
of prestige and power and consequently became dissatisfied, racially militant, 
and as a result developed high racial self-esteem (Porter & Washington 1979, 
1989). 

Another theoretical paradigm of the rise in racial self-esteem focuses on 
socialization into a changing subculture. According to this paradigm, when 
black consciousness began to be stressed in the late 1960s, individuals became 
socialized directly into in-group identification and black pride; this was 
reinforced by emerging institutions within the black community. Especially 
among black social scientists, the black consciousness movement is viewed 
not as a new phenomenon but as building on existing cultural and structural 
strengths within the black community (Jones 1980, Gwaltney 1980). For 
instance, this subcultural paradigm is the central theme of the Journal of Black 
Psychology, a major source of research on the group image component of 
black self-esteem. Racial self-esteem is treated by some psychological 
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theorists as a developmental process progressing through stages of identifica- 
tion with black community and culture, culminating in the linkage of the black 
experience and universal values (Porter & Washington 1979, 1989). 

The alienation paradigm, popular among sociologists and social psycholo- 
gists in the late 1960s and early 1970s, is the paradigm of racial self-esteem 
most closely linked to sociological theory. It proposed that because of feelings 
of powerlessness and isolation, alienation among black lower-income groups 
would lead to attraction to black identity movements (Porter & Washington 
1979, 1989). 

Apart from theoretical considerations, empirical research supports the 
existence of black identification and high racial self-esteem for most African 
Americans, although literature on very young children reveals mixed findings. 
Studies of adolescents and adults indicate a clear shift over time toward more 
positive racial self-esteem. High racial self-esteem is reported among middle- 
class groups, but the findings on lower-income groups are inconsistent (Porter 
& Washington 1979, 1989). 


Comparison of Ethnic and Race Paradigms of Group Image 


Convergences exist between the paradigms used to explain ethnic and racial 
self-esteem. The social psychological mechanisms in the race paradigms are 
the explanations that link the cultural and structural forces described in the 
ethnicity literature to variations in individual ethnic identity. Relative depn- 
vation is the psychological process that explains the relation between ethnic 
competition and self-esteem. Alienation is the psychological underpinning of 
the internal colonialism model. The assimilation/acculturation models are 
explained in part by the “effect of subculture” paradigm; the developmental 
models of a more integrated racial and universal identity are particularly 
appropriate for theories that deal with multidimensional assimilation and 
acculturation. The psychological effect on self-esteem of incorporation into 
ethnic labor markets is also explained in part by the effect of socialization 
into a subculture with its supporting institutional structure. 

Major differences also exist between the race and ethnic literature on the 
group image component of self-esteem. The ethnicity paradigms emphasize 
cultural and structural correlates of group image. For example, factors such 
as the role of migration, labor market incorporation, the difference between 
assimilation and ethnic identification, and the multidimensional nature of 
assimilation and acculturation are basically ignored in the race paradigms. 
Also, in much of the racial self-esteem literature, African Americans are 
viewed as being acculturated into American society. Aside from a small group 
composed primarily of black psychologists and anthropologists, few people 
view black culture as a separate, alternate form of cultural experience, as is 
ethnic culture. But, on the other hand, most of the studies of racial self-esteem 
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stress the social psychological process of identity transmission, whether it be 
socialization, relative deprivation, or alienation—phenomena the ethnicity 
literature does not directly address. 


Research Findings on Group Image: Hispanics and Asian 
Americans 


There is no standard research instrument to measure ethnic self-esteem among 
either Hispanic or Asian American subgroups. Ethnic identity has been 
measured by unidimensional behavioral scales (Garcia & Lega 1979), 
language ability (Ortiz & Arce 1984), and generation (Kitano 1969), or 
respondents have been asked for their self-identity (Garcia 1982). Others have 
developed scales based on semantic differential ratings and background 
information (Olmedo et al 1978). Several studies measure the difference 
between acculturation and ethnic identification (Masuda et al 1970, Clark et 
al 1976, Connor 1977, Szapocznik & Kurtines 1980, Keefe & Padilla 1987, 
Montgomery 1992). Among young children, a variety of picture and line 
drawing techniques are utilized. 

Some studies of children report that Mexican American children show 
higher group identification and preference than whites or blacks (Rice et al 
1974, Goebel & Cole 1975, Rohrer 1977, Levine & Ruiz 1978), but some 
report lower ethnic/racial self-esteem (Weiland & Coughlin 1979, Teplin 
1976, Iadicola 1981, Newman et al 1983). Research on Chinese American 
children finds strong ethnic identity (Sung 1987), but other research reports 
that Chinese American children show less racial self-esteem than African 
Americans or whites (Fox & Jordan 1973). In a study of high school students, 
Bowler et al (1986) indicate that African Americans have a more positive 
group image dimension of self-esteem than do Hispanics, but Hispanics have 
more positive self-esteem than Asians. It is difficult, however, to interpret 
these studies. The samples are often small, there is little control for ethnic 
context, and social class and region are rarely controlled and sometimes not 
even specified. 

Some studies of adults indicate support for the unidimensional assimila- 
tion/acculturation model (Fong 1973, Montero 1981, Levine & Rhodes 1981). 
The multidimensional assimilation/acculturation or pluralist paradigm has 
received more support. Particularly notable has been the persistence of a high 
degree of ethnic identification and/or association even when awareness of 
elements of ethnic culture has declined; this result has been noted among 
Japanese Americans (Clark et al 1976, Woodrum 1981, Kendis 1989, Fugita 
& O’Brien 1991), Chinese Americans (Yao 1983), and Mexican Americans 
(Clark et al 1976, Garcia 1982, Keefe & Padilla 1987). Chinese American 
students have more positive ethnic identities than do Japanese American 
students (Sue & Kirk 1973), and Cuban Americans have demonstrated more 
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positive ethnic identities than Mexican Americans (Portes & Bach 1985), 
offering some indirect support for the effect of ethnic economies on group 
image. 

Although cultural awareness declines over generations, ethnic loyalty may 
persist among both Asian Americans (Masuda et al 1970, Connor 1977) and 
Hispanics, although ethnic identity among succeeding generations is some- 
what weaker than in the immigrant generation (Padilla 1980, Keefe & Padilla 
1987). Birthplace is also associated with ethnic self-esteem; the foreign born 
have a better group image than the native born (Buriel & Vasquez 1982, 
Martinez & Mendoza 1984, Rodriguez 1989). Lower socioeconomic status 
and barrio residence are related to more positive ethnic esteem among Mexican 
Americans (Keefe & Padilla 1987). However, upwardly mobile Cubans 
(Portes & Bach 1985) and Japanese Americans (Feagin & Fujitaki 1972) are 
more aware of discrimination, and ethnic identity is strong. City of residence 
may also affect ethnic self-esteem; individuals who live in cities where their 
ethnic group is less segregated are more likely to develop an ethnopolitical 
identity, and those who live in more ethnically segregated cities are more 
likely to retain a traditional, nonpolitical identity with fewer social class 
differences in group image (Grebler et al 1970, Sanchez-Jankowski 1986). 
Lower class Mexican American youths who live in less ethnically segmented 
cities, however, are more alienated from the host society (Sanchez-Jankowski 
1986). These findings support variants of the internal colonialism, assimila- 
tion/acculturation, and ethnic competition models. 

Hispanics, Asian Americans, and African Americans show a high degree 
of identification with the group and positive evaluation of the self as a member 
of the group. All groups, especially the young and socially mobile, have 
moved toward a positive ethnopolitical identity. The literature on sources of 
variation in group identity is not consistent for any group, though research 
suggests that middle class African Americans have the most positive group 
identity among blacks. The findings on social class and group identity for 
Hispanics and Asians, however, have been less clear. 

The literature on the group image component of self-esteem for Hispanics 
and Asian Americans does not disaggregate the effects of a consonant 
structural and a consonant subcultural context; for instance, the effect of 
economic incorporation in an enclave is not disaggregated from lack of 
acculturation. Group interaction is sometimes used to measure identity, which 
is a psychological construct. Ethnopolitical identity is often not clearly 
distinguished from a traditional ethnic identity. Both may be positive, but 
they are responses to different influences. The effects of migration, sojourner 
status, and age also need to be distinguished more clearly in the empirical 
literature, and more systematic controls for social class are necessary. These 
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ethnic paradigms have implications for the group image component of 
self-esteem, but the implicit connections have not been clearly specified either 
theoretically or empirically. The psychological mechanisms linking structural 
and cultural factors to self-esteem need more careful articulation. Research 
on Hispanic and Asian American populations needs to be directed specifically 
to the investigation of correlates of self-esteem rather than inferring self-es- 
teem as a derivative of group processes. 


PERSONAL SELF-ESTEEM 


Distinct from feelings about being a member of a racial or ethnic group, an 
individual acquires a general evaluative view of the self. We call this personal 
self-esteem which, if high, comprises feelings of intrinsic worth, competence, 
and self-approval rather than self-rejection and self-contempt (Porter & 
Washington 1979). Although a large body of literature relates ethnicity and 
mental illness (Vega & Rumbaut 1991), few studies among Hispanics and 
Asian Americans deal with general personal self-esteem among those who are 
not mentally ill. Among African Americans, however, personal self-esteem 
has replaced racial self-esteem as the major focus of research on black 
personality. Because this area has received far less attention than ethnic 
self-esteem among Hispanics and Asian Americans, we briefly summarize the 
three basic paradigms of personal self-esteem utilized for Hispanic and Asian 
American populations and then compare them to the three major paradigms 
used to explain personal self-esteem among blacks. 


Paradigms of Personal Self-Esteem: Hispanics and Asian 
Americans. 


CULTURE CONFLICT, MARGINALITY, AND PERSONAL SELF-ESTEEM This widely 
used paradigm of personal self-esteem among Hispanics and Asian Americans 
assumes a unilinear assimilation model; ethnic and host culture are viewed as 
incompatible. In short, it suggests that individuals cannot tolerate cultural 
marginality which leads to psychological distress and a poor personal self 
image. As the individual assimilates and acculturates, there is less conflict 
and greater psychological health (Martinez & Mendozo 1984). Alternate 
models are the bicultural and the pluralism models, which hypothesize that 
ethnic/racial groups can never fully assimilate; thus, individuals aligned with 
traditional culture do not as readily succumb to the effects of racism and are 
able to explore new cultural avenues. Alignment with both host and ethnic 
culture and institutions provides the individual with psychological flexibility 
and adaptability and good personal self-esteem (Sue & Wagner 1973, Martinez 
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& Mendoza 1984). Implicit in all of these models is the notion that group 
and personal identity are isomorphic; positive group esteem will lead to 
positive personal self-esteem. 


SOCIAL MOBILITY PARADIGM OF PERSONAL SELF ESTEEM According to this 
paradigm, which is frequently utilized to explain personal self-esteem among 
Hispanics and Asian Americans, social mobility and economic success will 
lead to feelings of competence, ability to cope with acculturative stress, and 
good personal self-esteem (Takaki 1989, Vega & Rumbaut 1991, Moyerman 
& Forman 1992). The "model minority" thesis is a variant of this approach: 
ethnic groups with values that stress hard work and achievement will be 
successful economically, and thus manifest a high personal self-esteem. This 
model minority paradigm is frequently applied to Asian American subgroups. 
However, the model minority thesis has been criticized for neglecting the 
existence of ethnic/racial discrimination, income inequality, inadequate fund- 
ing for social services, and socioeconomic heterogeneity within the relevant 
group (Wong 1985, Takaki 1989). 


FAMILY AND COMMUNITY STRUCTURE PARADIGM OF PERSONAL SELF-ESTEEM 
This paradigm explaining personal self-esteem among Hispanics and Asian 
Americans suggests that the individual may find personal support networks, 
economic opportunities, and social acceptance in ethnic communities (Vega 
& Rumbaut 1991). Kin networks are also important for mutual aid and 
support. Individuals who have social associations within ethnic communities 
or extended ethnic kin networks may experience more personal self-esteem 
(Keefe & Padilla 1987). 


Paradigms of Personal Self-Esteem: African Americans 


Much research has demonstrated that personal self-esteem among African 
Americans is not lower than it is among whites. Middle-class African 
Americans have higher personal self-esteem than lower income groups (Porter 
& Washington 1979, 1989). The three major paradigms used to explain these 
findings are theories of social evaluation, locus of control, and socialization 
and the strength of community. 


THE SOCIAL EVALUATION PARADIGM OF PERSONAL SELF-ESTEEM This theory 
maintains that human beings learn about themselves by comparing themselves 
to others. A consonant racial context will cause African Americans to utilize 
their own community for social comparisons and is protective of personal 
self-esteem. Alternatively, a dissonant racial context will cause poor personal 
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self-image (Porter & Washington 1979, 1989). Although developed by 
psychologists, this paradigm has also been utilized in sociological literature. 


THE LOCUS OF CONTROL PARADIGM OF PERSONAL SELF-ESTEEM ‘This para- 
digm suggests that internal control is a perception that reward is contingent 
on one’s own behavior; external control is the notion that rewards are 
controlled by external forces. This theory postulates that high external control, 
involving blaming the system of racial discrimination rather than the self, 
leads to high self-esteem for blacks (Porter & Washington 1979, 1989). 


THE SOCIALIZATION AND THE STRENGTH OF COMMUNITY PARADIGM OF PER- 
SONAL SELF-ESTEEM The lower class black community has been characterized 
by social and family disorganization with consequent negative effects on 
self-image; however, a contrasting paradigm views the black community not 
as a deficient or pathological environment but as an alternative culture, 
containing its own distinctive institutions and family forms. It suggests that 
individuals in this culture are actively socialized into a positive, supportive 
tradition. Although this model also exists in the sociological literature, the 
major proponents of this paradigm of personal self-esteem have been African 
American psychologists, who have criticized white psychologists for ignoring 
the black experience and interpreting it in the context of white norms and 
culture (Porter & Washington 1979, 1989). 


Comparison of Race and Ethnic Paradigms: Personal 
Self-Esteem 


The ethnic paradigms stress the role of culture and structure in personal 
self-esteem. In contrast the race paradigms, whose major proponents have 
been psychologists, emphasize psychological mechanisms like socialization, 
self evaluation, and locus of control. There are clear convergences, however. 
Social psychological mechanisms described in the literature on race are the 
mediators between the sociocultural factors discussed in the ethnic literature 
and personal self-esteem. The community socialization paradigm found in the 
race literature analyzes psychological mechanisms that help to explain the 
relationship between family and community structure and self-esteem, which 
is often highlighted in the ethnic literature. The psychological mechanism of 
social evaluation is assumed rather than specified in the ethnic assimilation 
theories. In contrast, the locus-of-control paradigm focuses on mechanisms 
that can explain the ethnic social mobility model without the “model minority” 
implications. Economic competition produces awareness of discrimination, 
with the result that social mobility is likely to lead to system rather than 
self-blame, higher self-esteem, and ethnic revitalization movements. 

Also, the race and ethnicity paradigms diverge in significant ways. The 


MINORITY SELF-ESTEEM 155 


supportive roles of ethnic community and family structure have been stressed 
for Hispanics and Asian Americans but have received less emphasis in the 
literature on blacks. In this literature on blacks, however, more attention is 
paid to personal self-esteem, and the underlying social psychological models 
are more clearly articulated, whereas in the ethnic literature, the effects of 
culture conflict and social mobility are more explicitly treated. Subgroup 
culture for Hispanics and Asians is seen as being different from the host 
culture rather than as being deviant or pathological, the way it is often 
characterized in the literature on African Americans. 


Research Findings on Personal Self-Esteem Among Hispanics 
and Asian Americans 


Personal self-esteem is most often measured by the Rosenberg Self-Esteem 
Scale or the Piers-Harris Children’s Self Concept Scale (Porter & Washington 
1989). Some studies of children indicate that Mexican American children 
have lower personal self-esteem than do Anglo Americans and/or black 
Americans (Knight et al 1978, Stephen & Rosenfeld 1978, Grossman et al 
1985). Other studies find no difference between black, Anglo, and Mexican 
American children (Samuels & Griffore 1979, Larned & Muller 1979, Franco 
1983). Several studies report that Koreans and Japanese Americans have a 
poor body image component of personal self-esteem compared to blacks or 
whites. In general, research on children indicates that Asian American children 
have lower personal self-esteem than do Anglos, blacks, and/or Mexican 
Americans (Chang 1975, Henkin & Nguyen 1984, Pang et al 1985, Bowler 
et al 1986). These findings are difficult to interpret because few of these 
studies control for the effects of class, region, or ethnic contact. 

Studies of adults indicate that biculturalism is related to psychological 
adjustment among Asian (Vega & Rumbaut 1991) and Hispanic (Szapocznik 
& Kurtines 1980, Phinney 1991) subgroups; however, personal self-esteem 
among Hispanic and Asian American adults has not been widely investigated. 


Relationship between Group and Personal Self-Esteem 


Most of the ethnic paradigms already discussed posit a reciprocal relationship 
between racial and personal self-esteem. Positive group image is viewed as 
protective of personal self-esteem, and high personal self-esteem leads to 
ability to cope with racial and ethnic discrimination more successfully. 
Research indicates that high personal self-esteem is related to good group 
image among Hispanic and Asian subgroups (Garcia 1982, Padilla 1987, 
Chavira & Phinney 1991, Phinney 1991). However, recent research on 
African American populations provides evidence that the two dimensions do 
not necessarily vary together (Porter & Washington 1989). Further research 
on this relationship is needed. 
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CONCLUSION 


Implications for Theory 


Hispanics, Asian Americans, and African Americans all have experienced a 
history of discrimination and negative stereotypes directed against them. 
Ethnicity is the salient defining characteristic of Hispanics and Asian 
Americans. Racial categories are imposed on them by the host society, but 
differences in language, a clearly defined subculture, and continuing migration 
streams as well as in skin color help perpetuate subgroup ethnic identification. 
Legal discrimination against African Americans has been longer and more 
severe than that experienced by other groups. With the erosion of African 
subgroup differences under slavery, skin color became the defining attribute 
of African Americans. Until recently, most literature on African Americans 
treated them as acculturated into American society and as having no viable 
subculture of their own, other than one characterized by pathology (Porter & 
Washington 1989). 

Empirical investigation of self-esteem has been far more extensive for 
African Americans than for Hispanic and Asian subgroups because of the 
severity of racial discrimination historically experienced by blacks in the 
United States. In addition, this research was stimulated by legal challenges 
seeking to demonstrate that low self-esteem was an effect of racial discrimi- 
nation. Much of the early research on African Americans was done by 
psychologists; since self-esteem is a major focus of psychological research, 
there is a tradition of direct empirical investigation of self-esteem among 
African Americans that exists to a lesser extent among Hispanics and Asian 
Americans. In the Asian American and Hispanic literature, self-esteem 1s 
often a derivative based on implicit assumptions of a connection between 
group or personal self-esteem and sociocultural forces. In short, many 
researchers fail to investigate this relationship directly. Recent paradigms 
(Keefe & Padilla 1987), however, begin to make a clear distinction between 
culture, structure, and psychological identification. 

Generally speaking, recent paradigms and findings on ethnic/racial groups 
have been characterized by emphasis on the rise of ethnic racial consciousness 
and its effects on identity for younger generations, the presence of group pride 
and high personal self-esteem in ethnic subgroups, and the effect of a 
supportive subcommunity in maintaining a positive group identity. They also 
focus on heterogeneity in self-esteem within any subgroup by factors such as 
socioeconomic status, age, and consonance of context. Because the number 
of black, Hispanic, and Asian American social scientists has increased and 
the interest in ethnic consciousness stimulated by the 1960s movements has 
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become professionally institutionalized, research on both racial and ethnic 
self-esteem has focused increasingly on the structure and internal variations 
of ethnic subcommunities. 

Empirical research has confirmed the importance of the variables high- 
lighted by all of the paradigms. First, cultural and structural pluralism, ethnic 
competition, and relative deprivation are linked to the rise of an ethnopolitical 
consciousness in all of these groups, especially among their middle classes. 
Second, the assimilation, ethnic economy, and subcultural paradigms are 
supported by evidence linking high ethnic/racial self-esteem to a consonant 
ethnic/racial context. Third, while the alienation and internal colonialism 
paradigms explain some of the recent findings on the group image of lower 
class young people, group identity does not conform to one model, but varies 
with time, place, and social location. In general, these findings suggest that 
a multidimensional model would be more appropriate for explaining the 
relationship between racial/ethnic identity and social context. The complex 
relationship of context, group self-image, and personal self-esteem warrants 
further investigation. 

Although the African American race literature and the Hispanic and Asian 
American literature on ethnicity draw upon largely different paradigms to 
explain self-esteem, an integration of these perspectives is possible. The 
social-psychological models of racial self-esteem can add a useful corrective 
to the ethnicity literature, which does not clearly articulate the psychological 
processes intervening between cultural and structural theories and group and 
personal self-image. Incorporating models of personal self-esteem from the 
literature on African Americans would shift the focus of the Hispanic and 
Asian literature to a variety of personality responses wider than the present 
focus on mental illness. Conversely, the acculturation/assimilation paradigms 
in the ethnicity literature fruitfully can be applied to African American 
personal and group image. A deficit model currently characterizes much of 
the social science treatment of African American culture; blacks are viewed 
as being either assimilated or oriented to an inferior replica of mainstream 
culture. Research on African Americans in the last decade stresses the positive 
role of black culture as the source of support and behavioral orientation that 
strengthens the identity and self-esteem of black Americans (Jones 1980, 
Gwaltney 1980), a conclusion consistent with both the social evaluation and 
the supportive subcommunity paradigms. Yet the pluralism paradigm of 
acculturation/assimilation can be usefully applied to research on African 
Americans. Because of its emphasis on cultural difference rather than cultural 
deficits and its use of a multidimensional contextual framework, this model 
could provide a more effective perspective for analyzing Airican American 
self-esteem by helping to explain why both group and personal image among 
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African Americans do not differ from those of whites (Porter & Washington 
1979). 


Implications for Research 


Research on Hispanic and Asian American self-esteem needs to address clearly 
the. difference between the racial and personal dimensions of self-image and 
to develop adequate measures of these constructs which are culturally 
appropriate. Research on Hispanic group image is currently beginning to 
address these issues. Studies of African American self-image need to be 
informed by sociocultural variables; the effects on self-esteem of social 
mobility and social contextual variables like form of labor market incorpora- 
tion need to be explicitly addressed. Personal self-esteem needs far more 
research among Hispanics and Asians. Also, attention must be paid in all 
groups to the situational nature of ethnicity, the salience and intensity of ethnic 
identity, and the reciprocal relationship between personal and group image. 
The effect of American racial categorization on Latino identity needs special 
investigation (Montalvo 1991, Rodriguez 1989). More research should be 
conducted on self-esteem among other Hispanic and Asian American sub- 
groups, and these models should be thoroughly tested on racial/ethnic groups 
outside the United States 

We have stressed theoretical paradigms of self-esteem in part because 
empirical research explicitly investigating Hispanic and Asian American 
self-esteem is still not as extensive as the literature on this topic for African 
Americans. Also, future research on personal and group self-esteem for a 
particular population can benefit from the awareness of models applied to 
other racial/ethnic subgroups. From the 1950s to the 1970s, research on the 
effect of social forces on self-esteem focused on African Americans. Much 
of the explicitly sociological literature on African Americans does not address 
the specific dimension of self-esteem. The paradigms informing research on 
African American self-esteem are largely psychological, undoubtedly reflect- 
ing both the origin and the long tradition of this particular body of research 
in the fields of psychology and social psychology. The extensive body of 
theory that grounds the discussion of Hispanic and Asian American self-es- 
teem, on the other hand, frequently treats self-esteem as derivative of 
sociocultural processes without directly testing these relationships. There is 
an important and growing empirical literature investigating self-esteem and 
its correlates among Hispanic populations. Yet there is a critical need for 
more empirical research on self-esteem that directly tests the implications of 
these theoretical paradigms, especially among Asian Americans where there 
is less research on self-esteem than there is among other subgroups. As the 
interest in Hispanic and Asian studies increases, investigations of self-esteem 
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among these groups can both benefit from and inform our understanding of 


African American self-concept. 
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Abstract 


Despite recent developments in “fiscal sociology” (i.e. the sociological 
analysis of taxation and public finances), few efforts have been made to 
synthesize scholarly developments in the field. This is particularly surprising 
because much of the literature on the determinants of taxation bears directly 
on the debates in political sociology about the social, political, institutional, 
and other determinants of government policy. This essay reviews and 
integrates the literature on tax policy formation and relates it to these general 
debates. It also explores some of the important but often neglected effects 
that taxation has on such phenomena as political revolution, state building, 
economic organization, labor force participation, and philanthropy. Thus, this 
essay demonstrates the significance and reviews recent developments in fiscal 
sociology. 


THE STATE AND FISCAL SOCIOLOGY 


Schumpeter (1954, pp. 6-7) observed that the extraction of tax revenue by 
the state has an enormous influence on economic organization, social 
structure, human spirit and culture, and the fate of nations. For this reason 
he argued that the study of the social processes behind taxation and public 
finances, that is, fiscal sociology, is one of the best starting points for an 
investigation of society, and particularly its political life. Although others 
have echoed his call for the development of fiscal sociology (Bell 1976, Jacobs 
1988, O'Connor 1973, Padgett 1981), until recently the field has remained 
in its infancy (Block 1981, p. 3, Swedberg 1991, p. 48). Moreover, the 
literature that has emerged is scattered across social science disciplines. This 
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164 CAMPBELL : 
essay begins to integrate this work, relates it to other areas of sociology, and 
indicates some directions for future research. 

Fiscal sociology differs from other approaches to the study of taxes and 
public finances because it considers how these things affect and are affected 
by a wide range of political, economic, cultural, institutional, and historical 
factors. For example, mainstream public finance economists assume that the 
state 1s a rational actor responding through fiscal policies to market failures 
in ways that are consistent with macroeconomic policy programs and 
neoclassical efficiency criteria. Furthermore, public finance economists are 
primarily concerned with how fiscal policy will affect revenue flows, income 
distributions, and economic performance (Helms 1983, p. 27, Karran 1985, 
p. 366). In contrast, although fiscal sociology 1s concerned with these issues, 
it also recognizes that a variety of non-economic causes and consequences 
are involved that are just as important as those discussed by the economists 
(Goldscheid 1958, Mann 1943, Schumpeter 1954). Similarly, whereas 
political scientists often attribute fiscal policy to political and bureaucratic 
inertia, fiscal sociology recognizes that there are many forces 1n society as 
well as within the state that can override the forces of inertia (Padgett 1981).' 
In other words, fiscal sociology differs from other approaches by focusing 
explicitly on the complex social interactions and institutional and historical 
contexts that link state and society in ways that shape fiscal policies and their 
effects. 

Although fiscal sociology explores how taxes and expenditures are deter- 
mined and affect society (Padgett 1981, p. 76), I focus here exclusively on 
taxes. First, political sociologists have paid much more attention to the 
determinants of expenditures than taxes (e.g. Quadagno 1987, Skocpol & 
Amenta 1986). The neglect of tax policy represents an important gap in our 
knowledge (Alt 1983, pp. 185-86) because taxes are the state's “source of 
life” (Marx 1969, p. 482) and because the determinants of tax and expenditure 
policies are not necessarily the same (Alt 1983, p. 186, Blackley 1990, 
Carruthers & Amenta 1991, Peters 1979, p. 790). Thus, studying the 
determinants of tax policy can provide new insights about the state and policy 
making. Second, although scholars have recognized how state expenditures 
affect the organization of economic activity (Lindberg & Campbell 1991), 
state building (Hooks 1990, Mann 1984), labor markets (Scharpf 1984), and 
other sociological phenomena, they are just beginning to recognize that tax 
policy also has significant effects. The fact that tax revenues increased from 


‘See Helms (1983) for a review of the public finance literature and Musgrave & Musgrave 
(1989) for a widely respected example of this tradition See Rose & Karran (1987) for examples 
of the political inertia tradition. 
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about 10% to 40% of national product in many Western nations since 1900 
(Hanneman 1982) suggests that the effects of taxation on society have become 
more important than ever. 

This essay consists of two parts. First, I review the empirical research and 
theorizing about the determinants of tax policy in advanced capitalist societies. 
This is my primary goal because much of this literature intersects current 
debates in political sociology about the policy-making constraints that 
democratic governments face. Because much of this literature focuses on the 
national level of government, I do the same unless noted otherwise. Second, 
I explore briefly some of the literature on the effects that tax policies have on 
political rebellion, state building, economic organization, labor force partic- 
ipation, and philanthropy to illustrate the wide-ranging influence that taxation 
has on phenomena that sociologists study. A discussion of taxation in state 
socialist and post-Communist societies is beyond the scope of this paper (but 
see Campbell 1993, Eckstein 1988, Walder 1992a,b,c). 


THE DETERMINANTS OF TAXATION 


Geopolitical Conflict 


Scholars generally agree that states extract more revenue during war (Mann 
1988, Tilly 1985, 1990), but they differ about why this happens. On the one 
hand, public choice theory suggests that political elites such as elected officials 
and state bureaucrats can raise taxes because citizens become especially 
tolerant of tax increases during war. Furthermore, when peace returns, 
tolerance erodes rather slowly, particularly if political elites institute new 
spending programs that citizens support. Thus, taxes are not always rolled 
back completely to pre-war levels (Peacock & Wiseman 1961). Of course, 
military pensions, reparations, debt, and other war-related obligations also 
prevent taxes from receding fully to pre-war levels irrespective of the wishes 
of citizens (e.g. Mann 1988, p. 94). Nevertheless, in this view the tax burden 
ratchets upward from one war to the next due to changes in the political 
tolerance of citizens for taxation. 

On the other hand, neo-Marxists, concerned primarily with the conditions 
under which democratic states in advanced capitalism act autonomously from 
the interests of capital, have argued that political elites may levy high taxes 
on capital during war because capitalists cannot afford to withhold investment 
while the economy is booming due to high levels of military spending. Under 
normal conditions the threat of capital disinvestment severely constrains the 
ability of political elites to raise taxes on capital. This is because higher taxes 
would undermine business confidence and trigger capital disinvestment, 


166 CAMPBELL 


thereby boosting unemployment and sparking an electoral backlash that 
political elites would rather avoid (Block 1980). In sum, taxes rise during 
war because the economic constraints that limit tax increases slacken. 

Finally, rational choice theorists maintain that taxes rise during war not so 
much because political and economic constraints ease as because self-aggran- 
dizing political elites simply choose to discount the importance of these 
constraints. Rulers, whose survival is at stake during war, try to increase taxes 
in order to finance their defense, even if higher taxes threaten economic growth 
and risk the possibility of political discontent later (Levi 1988, pp. 32-33). 
Thus, war leads to higher taxes because it alters the priorities of political 
elites, more than those of citizens, investors, or other groups in society. 

Altogether, only the revenue ratcheting view has received sustained 
empirical attention, and the results have been mixed (e.g. Rasler & Thompson 
1985). One reason for the lack of consensus is that many of these studies 
have suffered various research design problems, such as focusing on marginal 
shifts in revenues and expenditures whereas the theory is actually about more 
fundamental structural changes (Peacock & Wiseman 1979). Another problem 
is that most researchers have not distinguished between different types of war. 
Cross-national time-series analysis reveals that revenue ratcheting is most 
likely to occur in states that are involved in global wars, rather than more 
restricted interstate wars, because global wars are more expensive to finance 
(Rasler & Thompson 1985). Indeed, this may explain why corporate tax rates 
in the United States were not significantly influenced by the Korean or 
Vietnam wars (Jacobs 1988) but rose substantially during the two World Wars 
(Campbell & Allen 1992a). More important, the deeper theoretical issues 
have not been addressed empirically. Despite their relevance for debates in 
political sociology about the determinants of state policy in general, we still 
do not know whether taxes vary with war because citizens grant politica) elites 
more leeway, because the structural dependence of the state on capital 
investment subsides, or because war fundamentally alters the decision-making 
calculus of political elites. 


Macroeconomic Conditions 


The overall level of taxation as a percentage of GNP in the United States 
since World War II varied directly with economic prosperity (Griffin et al 
1985, Hicks 1984, Jencks 1985), presumably because political elites followed 
the Keynesian prescription of using tax cuts to stimulate a sluggish economy 
(Weir & Skocpol 1985). Similarly, inasmuch as political elites must not 
promulgate excessively high taxes on capital that precipitate capital disinvest- 
ment, neo-Marxists and some others hold that elites may be able to raise taxes 
on capital during economic expansions but must reduce them during economic 
contractions (Block 1980, Lindblom 1977, Pen 1987, but see Przeworski & 
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Wallerstein 1988). In this regard, the evidence is less clear. On the one hand, 
Jacobs (1988, 1989) disputes this view, finding that corporate tax rates did 
not fall during economic contractions in the United States. However, his data 
omit the severe economic declines of the 1930s and early 1980s. On the other 
hand, comparisons of US industries (Salamon & Siegfried 1977) and advanced 
capitalist countries (Swank 1992) indicate that corporate tax rates rise with 
profits and capital investment, thereby suggesting that political elites can raise 
taxes on capital when the economy is growing, but not when it is contracting. 
Resolving this issue would contribute much to the debate about how 
democratic states gain autonomy from the influence of capital. 

However, some neo-Marxists have theorized that political elites should also 
be able to dramatically increase tax burdens on capital during severe economic 
slumps. After all, the threat of capital disinvestment subsides at these times 
insofar as much disinvestment has already occurred (Block 1980). Coupled 
with the notion that taxation has become a tool for Keynesian macroeconomic 
management, there may be a curvilinear relationship, as yet untested, whereby 
political elites gain autonomy and can impose high taxes on business and the 
rich during extreme economic booms and busts, but lose autonomy and must 
keep these taxes relatively low during routine swings of the business cycle. 

Political scientists have noted that the tax burden as a percentage of GDP 
increases as the domestic economy becomes more open, that is, dependent 
on imports and exports. This is because political elites raise taxes in order to 
finance expensive macroeconomic stabilization programs, such as social 
insurance, unemployment compensation, and job training, which they use to 
compensate for the domestic effects of fluctuations in international economic 
conditions (Cameron 1978, Schmidt 1982). However, the effect of economic 
openness varies by the type of tax. Since the mid- 1960s, for example, personal 
income tax rates have generally increased with openness in advanced capitalist 
countries, but corporate income tax rates have fallen (Swank 1992). It is not 
clear why this is the case, or how economic openness is related to other types 
of taxes. However, inasmuch as corporate tax rates fall as organized labor 
becomes weaker (Campbell & Allen 19922), the declining influence of labor 
and the breakdown of neocorporatist bargaining in many advanced capitalist 
countries during the last 20 years—itself a product of increasing economic 
globalization (Lash & Urry 1987)—may account for the decline in corporate 
tax rates. 


Fiscal Crisis 


Although reliance on deficit financing since World War II may have 
undermined the link between expenditure and revenue policies (Amenta 1989), 
some theorists insist that the link has not been completely severed. Notably, 
O'Connor (1973) argued that capitalist states tend to incur escalating deficits 
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because the state stimulates economic growth through increased spending, but 
resists raising taxes in order to preserve its legitimacy. As a result, revenues 
tend to fall short of expenditures and cause fiscal crises to which political 
elites must eventually respond (see also Bell 1976, Goldscheid 1958, 
Schumpeter 1954, for criticisms see Musgrave 1980). One solution is to raise 
taxes, although this can be difficult politically (Block 1981). Nevertheless, 
political elites in the United States have often succeeded in raising income 
tax rates in response to budget deficits (Campbell & Allen 1992a, Conlan et 
al 1990). Whether this also occurred for other types of taxes, and whether 
this reduced deficits remains to be seen. In any case, answering these questions 
would shed light on the more general controversy about the governability of 
democracies, that is, whether political elites in democracies are capable of 
making the politically unpopular decisions sometimes necessary to ensure 
economic growth and the fiscal stability of the state (e.g. Bell 1976, Ch. 6, 
Crozier et al 1975, Rose & Peters 1978). 

To summarize briefly, the literature generally suggests that geopolitical and 
fiscal crises, as well as economic conditions, create pressures on political 
elites to increase taxes. However, the extent to which they do so is variable. 
For example, tax burdens since the 1960s have been higher in countries with 
neocorporatist systems of interest representation, and where leftist parties 
often ruled. Similarly, raising taxes has been a more common way of reducing 
deficits in these countries than cutting spending (Cameron 1985). Thus, it 
stands to reason that the organizational strength of social classes and interest 
groups, the system of political representation, and the institutional structure 
of the state may also influence taxation. 


Classes and Interest Groups 


The fact that tax struggles are among the oldest forms of class struggle 
(Goldscheid 1958, p. 202) suggests that the power of classes and other interest 
groups are important determinants of taxation. Indeed, corporate tax rates are 
often lower for industries with relatively large firms because these firms have 
more resources with which to influence legislators (Jacobs 1987, Salamon & 
Siegfried 1977). However, it is not clear that a high degree of industrial 
concentration, which is also supposed to enhance the political clout of business 
by concentrating economic power in the hands of a small number of large 
corporations, leads to lower corporate tax rates (Jacobs 1988, 1987, Salamon 
& Siegfried 1977). Furthermore, although a strong labor movement may shift 
the burden of taxation toward business and the wealthy (Gough 1979, p. 129, 
Jacobs & Waldman 1983), class compromises can emerge where labor agrees 
to tax breaks for capital in the hope that this will stimulate economic growth 
and provide more jobs, higher pay, and better benefits (King 1983). Thus, 
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important questions remain about the conditions under which class power 
influences the taxation of different classes. Finding answers requires that we 
develop better measures of class capacities and pay closer attention to how 
competing factions within these classes influence revenue policy making (e.g. 
Hunter & Nelson 1989, Martin 1991). Average firm size and industrial 
concentration ratios, for example, are rather indirect measures of business 
influence that could be supplemented or replaced with more direct indicators, 
such as membership size or lobbying budgets of influential trade associations. 

Less empirical work has been done on the degree to which the relative 
organizational strength of social classes and interest groups influence overall 
levels of taxation. Following rational choice theory, if political elites seek to 
maximize revenues and societal organizations try to minimize them, then one 
would expect that in the absence of well-organized classes and interest groups 
the level of taxation would be high (Bates & Lien 1985, Brennan & Buchanan 
1980, Levi 1981, 1988). However, capital and labor have been well organized 
through national associations in Sweden, former West Germany, and Britain, 
yet the tax burden in Sweden during the early 1980s was considerably higher 
than in West Germany and Britain, where it was about the same. Furthermore, 
the tax burden was lower in the United States where capital and labor were 
much less organized (Heidenheimer et al 1983, p. 173). Of course, this may 
be due to the fact that even when groups such as these are well organized 
they will accept higher taxes in exchange for various public goods (Cameron 
1985, pp. 257-60, Lamborn 1983), but it may also reflect how the systems 
of political representation in which these groups operate influence the effects 
they have on political elites. 

Before turning to this issue, however, it 1s worth noting that little research 
has been done to determine whether other interest groups, such as those 
representing women and minorities, influence tax policy (but see Jacobs & 
Waldman 1983). Yet their influence may be just as important as that of 
social classes (Musgrave 1980). For example, the elderly living on fixed 
incomes are often well-organized politically and seek to minimize sales, 
property, and other taxes that affect their costs of living. The neglect of 
these groups, particularly in quantitative research, may stem from the fact 
that it is difficult to construct measures of their organizaticnal capacities. 
Nevertheless, until this is done we will not know how theories of taxation 
based on class analysis fare in relation to more general interest group 
theories. This would be ironic insofar as the study of revenue policy provides 
an outstanding opportunity for adjudicating between class analytic and other 
State theories. After all, there are few policy areas in which the redistributive 
interests of social classes come into sharper conflict than tax policy (Steinmo 
1988, p. 404). 
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Comparative studies suggest that governments run by leftist parties favor 
higher levels of taxation (Cameron 1978, 1985; Tufte 1978, Ch. 4, Swank 
1992) and more progressive tax systems (Heidenheimer et al 1983, pp. 
178-79) than those run by conservative parties because leftist governments 
tend to spend more and favor progressive income redistribution. Whether the 
US Democratic Party is more inclined to raise taxes or favor tax progressivity 
than the Republican Party is unclear (Campbell & Allen 1992a,b, Carruthers 
& Amenta 1991, Jackson & King 1989, Jacobs 1987, Jacobs & Waldman 
1983, Reese 1980, p. 209, Witte 1985, p. 109). In fact, one study found that 
Democratic administrations favored slightly lower tax rates for business, 
presumably because Democrats had more trouble maintaining business 
confidence and, therefore, sufficient investment than Republicans, who are 
often viewed as being more sympathetic to business (Jacobs 1988). 

If partisan control has relatively little effect on taxation in the United States, 
it is probably because the differences between parties regarding taxation are 
smal] (Page 1983, p. 44) and often only symbolic (Leff 1984, but see Witte 
1985). Indeed, the United States lacks, by West European standards, any 
significant leftist party (Cameron 1985, pp. 235-36). Of course, partisan 
effects have occurred in the United States after major electoral realignments 
where one party suddenly gained control of both the executive and legislative 
branches of government (Hansen 1983). However, realignments have been 
rare because the electoral system ensures that representatives are elected to 
Congress independently of the election of the President. In contrast, parlia- 
mentary systems often ensure that the same party or ruling coalition controls 
both branches and that revenue policy often follows party lines (Steinmo 
1989). The point is that fiscal practices are more likely to be affected by 
partisan politics in a system with several parties, where the parties are arrayed 
along a broad ideological spectrum, and where one party controls both 
branches of government, than in a two-party system, where party differences 
are slight and different parties can control different governmental branches. 
In short, the institutional arrangement of politics matters here as in other areas 
of political life (March & Olsen 1989). 

In addition to partisanship, electoral politics is frequently invoked as an 
explanation for variation in taxation. Public choice theorists have argued that 
incumbent politicians lower taxes to garner votes as elections approach (Lee 
1986, Tufte 1978). However, most research suggests that electoral cycles do 
not affect taxes (cf Campbell] & Allen 1992a, Griffin et al 1985, Jacobs 1988, 
Jencks 1985, Karran 1985, see also Brennan & Buchanan 1980, Frohlich & 
Oppenheimer 1990, Levi 1983, Mitchell 1983). This apparent discrepancy 
may be due to the possibility that political elites reduce taxes or shift some 
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of the tax burden from one group to another in order to secure election 
primarily when the level of party competition is high and elections are closely 
contested. Again, however, the evidence is inconclusive (Jacobs & Waldman 
1983, Lowery 1987). Moreover, it appears that tax policy making is more a 
matter of incrementally adjusting the existing tax code because policy makers 
generally prefer to avoid upsetting voters with radical tax reforms. Thus, 
fundamental changes in taxation are rare (Campbell & Allen 1992a, Reese 
1980, Rose & Karran 1987, Witte 1985). 

Indeed, much work remains to be done to determine the effects of politics 
on taxation within and across countries. In particular, our understanding of 
the conditions that promote or inhibit partisanship 1s limited, as is our 
knowledge of the conditions under which incrementalism gives way to major 
transformations in taxation. Furthermore, even if public cboice theory is 
correct that electoral cycles and levels of party competition influence tax 
policy in the United States, it is not clear that these factors will be important 
in other countries with different party and electoral systems and in which 
politicians running for office tend to support programmatic party ideals more 
fervently than they do in the United States. Hence, studying the political 
determinants of taxation can be especially useful in adjudicating between 
public choice and institutional theories of policy making. 


State Structure 


There is considerable debate in political sociology about the degree to which 
the structure of the state apparatus influences policy making (Skocpol 1985). 
Some of this work on taxes comes from an institutionalist perspective. For 
example, one study found that in the United States, where tax policy making 
is accessible to many interest groups, a relatively light tax burden results 
because these groups force the creation of thousands of tax loopholes through 
which revenue is lost. On the other hand, where revenue policy making is 
less accessible to pluralist influence, as in Sweden, the tax burden is much 
greater (Steinmo 1989). However, few others have tried to determine the 
degree to which the accessibility of political elites to groups outside the state 
affects taxation across states. 

Similarly, although variation in this dimension of the state apparatus has 
facilitated change in other policy areas (e.g. Campbell 1988, Skowronek 
1982), there is little longitudinal research on how variation in the accessibility 
of political elites has affected taxation within countries over time. For 
example, although institutional changes afforded interest groups greater access 
to tax policy making in the United States during the mid-1970s (Reese 1980, 
pp. 120-24), thereby increasing the number of tax loopholes (Witte 1985, p. 
330) and weakening barriers against revenue loss (Webber & Wildavsky 1986, 
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p. 531), no one has determined how this actually affected the revenue stream. 
Whether similar changes have had effects in other countries is also unclear. 
In addition to the issue of institutional accessibility, rational choice theorists 
hold that the more the authority to tax is centralized, the more difficult it is 
for taxpayers to avoid paying taxes by moving to other geographical areas 
and, given the revenue maximizing nature of political elites, the more revenue 
the state will collect (Brennan & Buchanan 1980, but see Musgrave 1981). 
Although evidence from state and local governments in the United States is 
inconclusive (Hansen 1983, pp. 40-45, Oates 1985), the basic idea that tax 
policy is influenced by the capacity of political elites to actually collect taxes 
(Bates & Lien 1985) has stimulated important theoretical innovations. 
Notably, Levi (1981, 1988) argued that the degree to which the state imposes 
taxes on society is determined largely by its capacity to negotiate taxes with 
taxpayers, measure taxable economic activity, and monitor those who are 
taxed in order to ensure compliance. Hence, political elites will tax only 
insofar as the transaction costs of negotiation, measurement, and monitoring 
are justified by the revenues collected. However, because she does not test 
the theory (Levi 1988, p. 6), and because few other researchers have analyzed 
it empirically, we cannot be sure whether this transaction-cost approach to 
taxation holds. Measuring transaction costs is a big obstacle for testing this 
theory, as it has been in economic and organizational sociology (Perrow 1986, 
Ch. 7). However it is a necessary obstacle to overcome if scholars are to 
determine the value of more general theories of the state that rely on this 
approach (e.g. North 1981) and test further the general claim that policy 
formation is constrained by the structural capacity of political elites to 
implement policy (e.g. Skocpol 1985, Pressman & Wildavsky 1973). 


Ideology 


One theoretical objective of the rational choice approach is to “bring people 
back in” to the study of tax policy (Levi 1988, p. 7). However, it is ironic 
that rational choice theory tends to neglect the possibility that some political 
elites may be more inclined to maximize revenues than others for ideological 
reasons. Although scholars claim that ideological factors influence taxation 
(Braun 1975, pp. 313-17, Musgrave 1969, p. 135), rarely has anyone 
explored systematically how ideology, culture, or values affect tax policy, 
except insofar as the ideological differences among political parties are 
concerned. One exception is Webber & Wildavsky (1986) who argue that 
political culture determines the type of tax system different countries adopt 
(see also Pierce 1971). For example, countries whose political regimes 
(culture) are organized around markets favor individualism over centralized 
authority and, therefore, prefer low taxes; countries organized around central- 
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ized hierarchies favor collectivism over individualism and prefer high taxes 
(see also Musgrave 1969, p. 132, Tarschys 1975, p. 24). 

However, testing such theories is often difficult. First, key concepts such 
as culture, which Webber & Wildavsky (1986, p. 586) define as “shared 
values legitimating desired social practices,” often are specified only vaguely. 
Second, it is not clear whether the values in question are those of political 
elites or citizens in general. Finally, the causal mechanisms taat link culture 
to policy outcomes remain poorly detailed (Rose 1985). For instance, although 
they argue that market regimes based on individualism tax as little as possible 
in order to preserve individual autonomy and reduce the need for authority, 
Webber & Wildavsky do not demonstrate convincingly that this was a key 
motivation of policy makers in the United States and similar countries. One 
way to specify these linkages and remain true to the cultural thesis would be 
to argue that political parties are the bearers of political culture and serve as 
the transmission belt linking citizen preferences with policy outcomes. 
However, the declining significance of programmatic political parties and 
party discipline (Bell 1976, pp. 247-48, Skowronek 1982) suggests that this 
form of political culture has become a relatively less important determinant 
of tax policy. 

Ideology may nevertheless be important in policy making (e.g. Quirk 1990, 
Skowronek 1982). The point is that theories about the influence of ideology, 
values, and culture must be specified more precisely before real headway can 
be made in determining how much impact these factors have on taxation. 
Until this is done it will be very difficult to determine, for example, the degree 
to which political elites are revenue maximizers or are motivated by a wider 
set of ideological preferences. 


Summary: A Conceptual Model of Taxation 


A complete understanding of taxation requires that we identify the interaction 
between variables operating at different levels of analysis (Levi 1983). Based 
on the literature reviewed above, it is possible to construct a conceptual model 
to guide future theorizing and research. The model is summarized in Figure 
1 and seeks to explain major transformations 1n levels of taxation as well as 
the structure of tax systems. It is probably less useful for explaining the more 
routine, incremental adjustments to taxation. It begins with ihe proposition 
that pressures for transformation often stem from geopolitical, economic, or 
fiscal crises to which political elites respond by altering tax policies (Higgs 
1987, Sharkansky 1969, Ch. 5, Witte 1985). However, the effect of crisis on 
taxation is mediated in complex ways by a variety of additional factors. To 
begin with, classes and other social groups often seek to influence how 
political elites respond to crises. Groups have different levels of tax tolerance 


174 CAMPBELL 


POLITICAL RESPONSE 


Change level of taxation 

Change taxation system 

Change distribution of tax 
burden 


POLITICAL 
REPRESENTATION 


Political party & electoral 
system 

System of interest 
representation 

Party control 





Figure 1 Conceptual model of the determination of taxation 


and interests in different forms of taxation (e.g. progressive or regressive). 
Thus, the balance of power among these groups will influence how political 
elites craft tax policies. 

However, the influence of these groups on political elites is itself mediated 
by systems of political representation. That is, groups are provided with 
different means to affect tax policy making by the structure of political parties 
and electoral representation, the relative presence of pluralist or neocorporatist 
systems of interest representation, as well as shifting control of government 
by different political parties. The institutional arrangement of the state 
apparatus further mediates the influence of different groups on political elites. 
Thus, the degree to which political elites are accessible to public pressure and 
have the capacity to collect taxes determines the degree to which they will 
be able to act according to their own ideological preferences, or those of other 
groups in society. 

Of course, more variables and causal arrows could be added. Societal 
groups may try to influence revenue policy to obtain relief from economic 
problems (Karran 1985), inequality (North 1985, p. 392), or urbanization 
(Wagner 1958). The organization of economic activity may also affect taxation 
(Musgrave 1969, Ch. 5, North 1985, Wagner 1958). For example, in 
pre-modern trading economies states relied heavily on the extraction of goods 
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and services via tariffs, but in modern industrial societies where economic 
activity is monetized, states depend more on the extraction of money via 
income and other taxes (Tarschys 1988). Moreover, there may be feedbacks 
where, for instance, political elites influence the mobilization of social groups 
in the first place, such as when the Reagan administration organized business 
groups to lobby Congress to pass the 1981 tax cut (Martin 1991). 

Although the literature has reached a point where scholars are beginning 
to devise such models (e.g. Campbell & Allen 1992a, Karran 1985, Levi 
1988, Swank 1992), much research utilizes either a few qualitative case 
studies, or cross-sectional quantitative data, rather than historical and 
longitudinal evidence. Furthermore, because the determinants of taxation 
may be historically contingent (King 1983), researchers need to be sensitive 
to possible period effects that can only be identified through historical 
analysis (e.g. Campbell & Allen 1992a, Martin 1989, Schmidt 1982, Swank 
1992). Indeed, the longitudinal study of taxation provides an excellent 
opportunity to determine whether transhistorical theories of political change 
need to be modified to account for period effects—a methodological issue 
of great controversy among sociologists (Isaac & Griffin 1989, Kiser & 
Hechter 1991). 


THE CONSEQUENCES OF TAXATION 


Although taxation produces a variety of important sociological consequences, 
this aspect of fiscal sociology is much less developed than that which examines 
the determinants of tax policy. What follows are brief discussions of some of 
the more promising aspects of this work as they have started to emerge. 


Political Rebellion and Revolution 


Taxation has often contributed to violent political rebellions and revolutions, 
although there is debate about why this occurs. Some scholars suggest that 
large and unexpected tax increases have been one of the most important stimuli 
to popular rebellion in Western history, particularly when people found it 
impossible to pay the taxes demanded by government (Tilly 1978, p. 205). 
However, other researchers argue that excessively rigid tax requirements, such 
as head taxes or fixed land taxes, rather than tax increases per se, sparked 
rebellions when they threatened to undermine subsistence, as occurred, for 
instance, during crop failures and other economic calamities in peasant 
societies (Migdal 1974, Scott 1976). Moreover, exchange theorists hold that 
taxation elicits protest and rebellion from lower classes the longer these taxes 
are maintained without government programs that legitimize them (Lamborn 
1983). Of course, increased taxation may also trigger protest among the upper 
classes. If this inhibits tax collection enough, it can precipitate state fiscal 
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crises and foster revolution and state breakdown (Goldstone 1991, Skocpol 
1979). In this sense, revolution is caused by under taxation, rather than 
excessively high or rigid taxation (Goldstone 1991, p. 461). The point is that 
theories of revolution and rebellion based on an analysis of the effects of the 
state’s fiscal practices provide an important alternative to more conventional 
theories that focus on structural strain, relative deprivation, modernization, 
class action, and other dynamics. 

Of course, taxation has also triggered a variety of less dramatic anti-tax 
social movements (Adams 1984), such as those of the late 1970s in the United 
States where grassroots organizations tried to restrain the taxing and spending 
powers of state and local governments. Some scholars attribute these 
movements, notably California’s Proposition 13, to increasing public dissat- 
isfaction with the uncontrolled growth of government (Adams 1984, Rabushka 
& Ryan 1982). Others find little evidence for this (Freiman & Grasso 1982, 
Musgrave 1979, Peretz 1982) and maintain that anti-tax movements stemmed 
from other concerns, such as rising property tax rates (Kuttner 1980) and 
political leaders who were unresponsive to the tax-related problems of their 
constituents (Lowery & Sigelman 1981). More important from a social 
movements perspective, research indicates that grievances did not constitute 
a sufficient condition for these mobilizations and that additional factors were 
involved (Lowery & Sigelman 1981). Notably, Clarence Lo (1990) showed 
how a coalition of middle- and upper-middle income homeowners and small 
businesses had to utilize a variety of political and organizational resources 1n 
order to get Proposition 13 passed. Furthermore, because affluent homeowners 
and small business owners dominated the coalition, the initiative resulted in 
a regressive redistribution of the property tax burden. Whether this kind of 
initiative succeeds in limiting taxing and spending during periods of economic 
contraction when the demand for government services grows is basically 
unclear (Cox & Lowery 1990, Lowery 1983). Nevertheless, because anti-tax 
movements are so common, they provide empirical cases that can help 
adjudicate among social movement theories that contend, for example, over 
the relative influence of grievances, resources, political opportunity structures, 
and class and interest group dynamics in the mobilization process and its 
outcomes. 


State Building 


Historically, the taxes against which groups revolted often reflected the 
efforts of political elites in newly forming states to expand their power 
(Tily 1978, p. 230), usually in order to defeat foreign or domestic 
competitors and control the rivals of their clients (Tilly 1985). In fact, 
taxation has often been a key to successful state building (Ardant 1975, 
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Braun 1975), and has been so important that the survival of states (Mann 
1988, p. 109), as well as their rise and fall as world powers (Lamborn 
1983), has depended on the ability of political elites to extract taxes from 
society. However, in addition to providing finances for creating military 
and other state agencies, taxation has profoundly affected state building in 
other ways. In order to collect taxes, political elites had to build adminis- 
trative and enforcement apparatuses (Tilly 1975a, 1985), promulgate tax 
laws (Ardant 1975), and, in order to generate consent for taxation, create 
avenues of political participation, such as courts, prefectures, estates, and 
parliaments (Ames & Rapp 1977, p. 164, Ardant 1975, Braun 1975, Poggi 
1978, p. 52). For instance, constitutionalist states emerged where the upper 
classes agreed to shoulder a major portion of the tax burden in exchange 
for certain political rights, but where it was more difficult for political elites 
to strike such bargains, they created absolutist states to increase the tax 
burden on the lower classes instead (Mann 1988). 

Both the form and the level of taxation influences state building. In Latin 
America, for instance, when the state depends heavily on tbe taxation of 
international trade, rather than domestic economic activity, the state apparatus 
tends to be less developed because the collection of tariffs and duties does 
not require an elaborate fiscal structure. Furthermore, following exchange 
theory, when political elites refrain from taxing the domestic population they 
do not need to create as many agencies to deliver public goods as they would 
if they had to obtain the compliance of local tax payers (Gallo 1991). 
Moreover, if the level of taxation remains low, the state assumes little 
responsibility for economic management and does not need to establish 
agencies to manage regulatory, monetary, and other economic matters (Ardant 
1975, p. 228, Poggi 1978, p. 78). 

This does not mean that taxation is the only determinant of state-building. 
Indeed, class structure, the relationship of classes to political elites, the level 
of economic development and concentration, and other factors are also 
important (Mann 1988, Tilly 1990). Furthermore, the need to finance the 
development of military and other state activities may precipitate the con- 
struction of the political machinery necessary for borrowing as well as taxing 
(e.g. Bensel 1990, Brewer 1989, Dickson 1967). The point is that, in contrast 
to those who view state building as being driven primarily by economic and 
class-based forces (Anderson 1974, Wallerstein 1976), an alternative theory 
of state building recognizes that the efforts of political elites to alter the level 
and form of taxation is of fundamental importance. To sharpen the point, if 
it is true that theories of state breakdown must be based, at least in part, on 
an understanding of the fiscal problems of the state (Goldstone 1991), then 
it may also be true for theories of state building. 
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Economic Organization 


Taxation also has important consequences for the organization of economic 
activity. For example, after World War II the Christian Democratic govern- 
ment in Italy deliberately encouraged the growth of an extensive sector of 
small firms, linked through flexible subcontracting arrangements, by offering 
special tax breaks and other benefits to entrepreneurs, and by taking these 
benefits away if firms became too large (Lazerson 1988, Weiss 1988, 1984, 
p. 224). In contrast, the US government manipulated tariff policies during 
the nineteenth century in ways that contributed to the decline of entrepreneurial 
firms and the rise of large, concentrated companies and economic sectors 
(Lindberg & Campbell 1991, p. 364). Thus, economic and organizational 
sociologists who are interested in the origins of markets, hierarchies, and 
economic networks should also be interested in the effects of revenue policy, 
particularly insofar as they provide a fiscal dynamic for organizational change 
that differs from those offered by transaction-cost, population ecology, 
resource-dependence, and other theories. 

Additionally, taxation influences the location of economic activity, partic- 
ularly insofar as capital moves to areas with low tax rates (Bates & Lien 
1985). Indeed, low taxes in the southern United States contributed to the 
relocation of businesses and industries to that region after World War II 
(Bluestone & Harrison 1982, Campbell & Lindberg 1990, Markusen 1987, 
Newman 1984). Sub-national governments have long understood how taxes 
can influence the location of firms and have competed for potential investors 
by trying to offer the most lenient tax packages (Kuttner 1980, Ch. 10, 
Scheiber 1978, p. 636, 1980, p. 687). However, because much of our 
knowledge of these processes is based on qualitative or anecdotal evidence 
(but see Newman 1984, Papke & Papke 1986), more empirical work is needed 
to determine the extent to which regional variation in taxation influences the 
location of economic activity. Such research could provide important insights 
for urban sociologists and geographers concerning the relative merits of 
ecology and political economy theories of urban and regional development. 


Labor Force Participation 


Sociologists and other students of labor markets have long recognized how 
life cycles, demographics, and the supply and demand of labor influence labor 
force participation, particularly among women. However, taxation may also 
be important. For example, the negative income tax experiments conducted 
in the 1970s in the United States suggested that taxation directly affects labor 
force participation, especially among young people (Ellwood & Summers 
1986). Furthermore, although individual income tax rates for married workers 
in the United States, Britain, and other Western countries have generally been 
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lower than rates for single workers, married workers have been taxed on their 
combined earnings. This often puts married workers into higher tax brackets, 
increases their tax burden overall, and rewards families with one rather than 
two breadwinners (Leader 1983, Steinberg & Cook 1988). Because wives 
often earn less than their husbands it provides disincentives for wives to join 
the labor force (Bergmann 1986, pp. 218-20, Steinberg & Cook 1988, p. 
319). The problem is particularly acute in countries with highly progressive 
income tax systems, such as Sweden. In fact, Sweden eliminated joint tax 
accounts in 1971 deliberately to encourage married women to enter the labor 
force at a time when Sweden faced a severe labor shortage. Since then, the 
earned incomes of married couples have been taxed separately and, therefore, 
at lower rates (Adams & Winston 1980, p. 170). In contrast, Japanese tax 
laws discourage women's labor force participation by imposing stiff penalties 
on most of the income wives contribute to the family (O'Kelly & Carney 
1986). 

How much tax systems actually affect labor force participation is not 
entirely clear. Traditionally, economists argued that income and payroll taxes 
did not affect the supply of labor (e.g. Pechman 1977, Ch. 3). However, 
Hausman (1981) estimated that direct taxes on income and earnings reduced 
the number of hours worked by husbands and especially wives, and that a 
proportional rather than a progressive tax system would eliminate much of 
this distortion in the labor supply. Others have corroborated the general thrust 
of Hausman's findings for the United States and other Western countries 
(Blomquist & Hansson-Brusewitz 1990, Bourguignon & Magnac 1990, 
Colombino & del Boca 1990), but have noted that it 1s particularly difficult 
to specify with precision the effects the tax system has on the labor supply 
of married women (Triest 1990). In addition to refining methodological 
techniques, further research is required to determine, for instance, whether 
the effects of tax systems on labor force participation in the United States 
have been mitigated by the fact that the income of wives seems to have become 
increasingly necessary in order to maintain a family income formerly sustained 
by one breadwinner (Mischel & Frankel 1991, p. 37). In any case, studies 
of labor force participation, based on an analysis of fiscal policy, may provide 
a useful addition to the literature on labor markets and labor force participa- 
tion. 


Philanthropy 


Researchers recognize that philanthropy is affected by taxation, particularly 
insofar as the federal tax code provides deductions for charitable contributions. 
Virtually all studies of the effects of taxation on philanthropy in the United 
States concur that as the deduction for charitable contributions increases, so 
does charitable giving among most individuals (Clotfelter 1985, Clotfelter & 
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Steuerle 1981, Jencks 1987). Most of this work has been done by economists 
who pay relatively little attention to possible sociological determinants of 
charitable behavior, except for a few variables such as age, marital status, 
and education. This is an important omission to the extent that donors also 
contribute in order to enhance personal or family prestige, preserve family 
unity, memorialize themselves or other family members, support causes in 
which they believe strongly, and because they are embedded in personal 
networks that promote giving (Galaskiewicz 1985, Ostrower 1987). Never- 
theless, our knowledge is limited about the relative impact of taxation versus 
other sociological variables on philanthropic behavior (Schiff 1989, Simon 
1987, p. 84). Making such comparisons would shed light on the broader 
debate between economic sociologists, economists, and others about the 
conditions under which individuals are driven by self-interest or more altruistic 
motives (Etzioni 1988). 

It is worth noting that wealthy Americans established many large philan- 
thropic organizations and charitable trusts, such as the Ford, MacArthur, and 
Hearst foundations, in the first place to enable their descendants to avoid steep 
estate taxes and retain control of their family fortunes (Allen 1987). 
Economists have long been interested in how the tax code affects the 
distribution of income and wealth per se (Katz et al 1983, Pechman 1986). 
However, sociologists can make a unique contribution by analyzing how the 
tax code affects the creation of foundations, trusts, and other institutions that 
are designed to mitigate against the redistributive effects of tax policy and, 
therefore, contribute to the preservation of inequality. 


CONCLUSION 


Of course taxation may affect class differences (Mann 1943), corporate 
behavior (Shoven & Waldfogel 1990), economic activity (Aaron & Pechman 
1981), and many other social phenomena. In addition, the effects of taxation 
on family structure, marriage and divorce rates, fertility, and migration are 
areas ripe for study. More important, fiscal sociology can shed considerable 
light on debates of interest to political sociologists, including disputes over 
the conditions affecting state autonomy, the governability of democracies, 
state building, revolutions and social movements, and policy making in 
general. Moreover, fiscal sociology provides an excellent vehicle for exam- 
ining competing analytic frameworks, such as public choice, neo-Marxist, 
institutional, transaction-cost, exchange, and rational choice theories. Attend- 
ing to the effects of taxation and the fiscal practices of the state can also 
provide insights for economic sociology, organizational sociology, and other 
subfields of the discipline. In sum, although fiscal sociology may have been 
in its infancy only a short time ago, today it is approaching maturity in the 
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sense that it is developing an intellectual identity and research agenda of its 


own, as Schumpeter wished. 
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Abstract 


I review recent studies of work and occupations. Most of this work proceeds 
at the individual level, studying individual characteristics of workers, qualities 
of the work experience, and, to a lesser extent, stages of the work experience. 
Structural analysis is less common and often treats structural phenomena as 
aggregates rather than emergents, except in the area of labor relations. A 
substantial literature—probably a third of the total—examines particular 
occupations. Ín general the literature is divided into two "sides"—one focused 
on gender, inequality, and career/life cycle issues, the other on unions, and 
industrial and labor relations. Between these are smaller foci on theoretical 
issues and on general structures of work. I conclude that with the possible 
exceptions of Marxism and the study of professions, subfields of work and 
occupations lack the synthetic theory that would enable synthesis of empirical 
results. I also consider the twofold role of politicization in the area: the positive 
role of driving empirical investigation of new areas, the negative one of taking 
its own politics as unproblematic. 


INTRODUCTION 


An Annual Review author must cheat either breadth or depth. I cheat depth. 
To limit my topic to occupations alone is absurd. Changes in occupations 
cannot be construed without the work system that enfolds them. That normal 
sociology isolates social facts from their contexts—as status attainment 
separated achievement from structural change—only underscores the point, 
for what has stunted sociological knowledge if not this isolation? I therefore 
begin by situating the work and occupations (WO) literature historically and 
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disciplinarily. An exploratory quantitative analysis of the field then leads into 
reviews of selected areas within it. 


THE LOCATION OF THE WORK AND OCCUPATIONS 
LITERATURE 


The modern sociological literature on WO has two ancestries. The first 
includes Everett Hughes and his students. Hughesian studies of work rested 
on field study, Chicago-style ecological institutionalism, and a social psycho- 
logical focus inherited from W. I. Thomas and Robert Park. The ecological 
institutionalism combined with field study in the literature’s long-familiar 
emphasis on professions, while the social psychological concern ultimately 
bore fruit in its emphasis on job satisfaction. 

The Hughes tradition relates ambiguously to the other line of sociological 
inquiry into work—what is loosely called industrial sociology. [This is the 
tradition that is seen by Simpson (1989) as the core of WO.] Industrial 
sociology comprises diverse work institutionally denved from the human 
relations school of management; it takes formal shape in the field of industrial 
relations (IR). Methods differ between the IR and the Hughesian traditions; 
mutual citation is low and theoretical integration unusual. Few topics have 
drawn sustained attention from both literatures. 

The split is easily understood. The Hughesians saw industrial sociology as 
“applied sociology,” an orientation from which they had begun withdrawing 
by the 1950s (Smigel 1954). Industrial sociology was also closer to the 
organizations and bureaucracy literatures, always somewhat prescriptive and 
gradually moving into applied settings (i.e. business schools) throughout the 
postwar period. Writings on workplace participation well illustrate these 
differences. Driven by applied concerns and tied to formal theories of 
organization, this work has been ignored in the sociological literature on WO, 
although it is central to the IR literature. 

In the years immediately after World War II, professions were the central 
focus of the WO literature. (For histories, see Cohen 1983, Smigel 1954, 
Roth et al 1973, Hall 1983.) There was little on blue collar work. The shift 
towards psychological concerns—job satisfaction, alienation, and so on—took 
place in the 1950s. Like the parallel rise of the new statistics, this shift was 
part of the gradual move away from institutional or structural accounts (of 
work as of most other things) toward individualistic ones. Simpson (1989) 
writes that 1950s research often portrayed workers as controlled by larger 
structural forces rather than as independently acting, the latter image being 
characteristic of earlier, observation-based research. She infers that the 1950s 
therefore inaugurated a period of serious structural analysis. Quite the 
contrary. The abstractions of Parsons simply screened the refocusing of studies 
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of work on the individual level. Structural analysis now meant studying 
individual flows within a constant (controlling) structure (changes in employ- 
ment levels, mobility in socioeconomic hierarchies, etc.) This change—from 
conceiving structures as constructed, active, and contingent to conceiving 
them as dead but constraining—was driven largely by the methodological 
paradigm, as I have argued elsewhere (Abbott 1992). The 1950s' new interest 
in careers also grew from this individualism in structuralist disguise. 

These changes were consecrated in the 1960s by the rise of status attainment 
and the publication of The American Occupational Structure (Blau & Duncan 
1967). By assuming well-defined, constant occupations with well-defined 
statuses (artifactually shown to be "constant" by Hodge et al 1963—on which 
see Burrage & Corry 1981 and Coxon & Davies 1986), these new analyses 
of stratification negated en passant the ecological, processual approach to 
occupations, and indeed to work in general, allowing only for demographic 
movement within fixed structure. The status attainment tradition also firmly 
established occupational status mobility as a, or the, central substantive topic 
of the sociology of WO, encircling it with a set of individual determinants, 
above all gender, which became one of the two central topics of WO research 
by the late 1970s (Hall 1983). 

By the 1980s, the evolving WO tradition had come to look much as it does 
today. The only institutional subfield of real strength was the study of 
professions. To be sure, Braverman (1974) had resuscitated structural analysis 
in his justly celebrated book. But the DOT’s scales so transcendentalized 
“skill” that the ensuing research ignored changing structure altogether. In such 
an environment, individual-level topics like mobility, status, satisfaction, 
commitment, aspirations, and choice elicited most of the research. The most 
visible subareas of WO research were mobility and gender, the latter 
understood basically in status attainment terms even after Hochschild’s 
brilliant book (1983). There were structural-level conceptions, to be sure, like 
labor markets. But labor markets were defined in terms of individual 
properties—mobility chances between areas—rather than conceptualized as 
emergent structures (as markets were more generally by structuralists like 
Harrison White 1981). 

The WO literature has generally been isolated from research on work in 
other disciplines. Thus, the enormous (1000 articles per year) economic 
literature is seen as excessively abstract, although eventually the human capital 
approach it pioneered became a competitor of status attainment. Similarly, 
although a few social histories beguiled sociologists—Hareven's Family Time 
and Industrial Time (1982) and Tilly & Scott’s Women, Work, and Family 
(1978), for example—the majority were ignored. For example, uncited in the 
WO literature in sociology (in the 1990 Social Science Citation Index) were 
Montgomery’s The Fall of the House of Labor (1987), Graebner’s History of 
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Retirement (1980), Keyssar’s Out of Work (1986), Licht’s Working for the 
Railroad (1983), and Stearns’s Paths to Authority (1978). Similarly ignored 
were historical studies of family and external divisions of labor such as Pahl’s 
Divisions of Labour (1984), Hall et al’s Like a Family (1987), and Lowe’s 
Women in the Administrative Revolution (1987). Helmbold & Schofield (1989) 
and Rose (1986) give a sense of the size and richness of this unknown 
literature. 

The ignorance of both gender and labor history in WO stems from 
longstanding anti-structural bias. Union-membership, like gender, was easily 
treated as a variable, metamorphosed from a social or cultural construct into 
an attribute of an individual. Only the institutionalist students of professions 
made serious use of the new historical work (as in Larson 1977 or Abbott 
1988). Indeed, most sociologists of work who did use the historical literatures 
were historical sociologists, identified with the relatively theoretical, struc- 
tural, and somewhat leftist cast of that field. 

In short, the sociology of WO has for the last 20 years pursued a fairly 
narrow range of topics. It has focused on individual behavior and its immediate 
contexts, looking at psychological, personal, and social antecedents and 
consequences of work behavior. It has largely ignored other bodies of inquiry 
into work. There have been few attempts at general theoretical analysis outside 
Marxist writings and perhaps the sociology of professions. It is a sad fact that 
much or most of the exciting study of work today happens outside sociology’s 
provenance and even its interest. 


THE TOPOLOGY OF THE WORK AND OCCUPATIONS 
LITERATURE 


Nonetheless, sociology does produce an awesome amount on work. Socio- 
logical Abstracts (SA) lists about 500 articles and 70 dissertations per year 
on WO broadly defined, either under “sociology of occupations and profes- 
sions" or under "jobs, work organization, workplaces, and unions." I had a 
research assistant code these articles for the entire years 1990 and 1991. For 
each article, I have its general character (theoretical or empirical), country of 
major concern (if empirical), methods (if given), number of cases (if given) 
and a set of major keywords covering the central topics. Since I believe in 
internationalism, I did not rule out foreign language articles. Unless otherwise 
noted, descriptive remarks following are based on this data. 

Slightly more than half the articles abstracted in SA appear in sociology 
journals. Another 9% appear in psychology journals, 7% in economics 
journals, 695 in business journals (where I counted the labor relations 
journals), and 2-4% each in health, law, education, and anthropological 
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journals. Another 14% appear in a variety of other areas—mostly interdisci- 
plinary journals like Annales ESC. 

About two thirds of these articles are empirical. The proportion of 
theoretical work is highest among the economics articles (45% theoretical), 
then sociology (40% theoretical), but much lower in business (25%) and 
psychology (10%). The typical empirical sample 1s largest in business studies 
(n = 450), then sociology (350), economics (250), and psychology, education, 
and anthropology (150). Sociology produces most of the really large studies, 
perhaps because I have included demography in sociology. But the modal 
study in all fields examines 100 or 200 doctors or teachers or some other type 
of worker. 

Fields use diverse data. Economists rely more on published information 
(45%) than on surveys and questionnaires (25%) or interviews (20%). By 
contrast, sociologists rely first on surveys and questionnaires (4596), then on 
published information (30%), then on interviews (15%). Business and 
psychology both rely mainly on surveys and questionnaires (6596), with the 
rest mostly from published information in business and from interviews in 
psychology. Hughesian-style field study has nearly disappeared. Psychology 
leads with about 10% observation studies, while sociology, economics, and 
business all hover around 5%. It is in the other, minor literatures that 
observation survives; about 25% of empirical articles on werk in anthropo- 
logical or educational journals involve observation. 

Most empirical studies consider one country. There are 40 comparative 
studies (out of 663 empirical studies), 25 of them two-country. But the 
literature is cosmopolitan (partly because I regard the overseas literature as 
relevant, of course). Of 523 studies with 1dentifiably national datasets, only 
40% (205) primarily concern the United States. Another 12% (64) concern 
the United Kingdom, followed by Canada (6%, 31), Germany (4%, 23), the 
former USSR (4%, 19), Australia and Finland (3%, 14), the Netherlands (3%, 
13), and other countries. Language barners cause the surprising absence of 
Japan as a principal country of analysis (a total of 4 articles in these 1100). 

I now turn to topics studied. That the importance of topics varies somewhat 
across these literatures tells us much about how Sociological Abstracts selects 
articles for inclusion, a selection that I think reflects the tastes of the WO 
literature. (The reader should recall that an article can have several topics. 
Also, I used general headings for unspecified or “mixed” topics; these are not 
included in the lists immediately following.) In sociology, the most important 
single topic is gender, which is considered by about 20% of the sociology 
articles, much as in Hall’s (1983) figures for the early 1980s. The next most 
important substantive topics are individual status mobility, unions, and income 
(7% each), labor and social control of work (6% each), work organizations 
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and labor markets (5% each), and race, inequality in work experience, 
unemployment, technology, and manufacturing employment (4% each). 

In other fields concentration is much greater, reflecting selection. Nearly 
half (42%) of the economics articles are on unions, another quarter (22%) 
concern IR, and about 10% each concern unemployment and collective 
bargaining. (The overall economic literature on work is distributed quite 
differently.) The business literature is ransacked for the same topics, unions 
(22% of business articles), wages and management (each 13%), and collective 
bargaining (11%). In psychology, gender is again the top topic (16% of 
psychology articles), followed closely by stress (15%), then by job satisfaction 
(11%), psychological consequences of work (9%), individual status mobility, 
doctors, and unemployment (7% each). Since the chosen collateral literatures 
reflect sociology’s own major concerns in the study of work, I shall henceforth 
discuss the overall SA sample as a coherent whole. 

I begin with a discussion of the comparative importance of topics. There 
are four general areas: general issues, individual level topics, structural level 
topics, and particular occupations or areas of employment. 

Theory is the most important general issue, although a relatively small 
fraction (about 16%) of the literature concerns theoretical issues. (In what 
follows the percentages are always percentages of all articles, n = 1100.) 
Most of these articles are actually theoretical analyses of more specific areas; 
articles about theory alone are unusual. There is also a small literature (5% 
of the total) dealing with cultural images of work, often closely tied to 
theoretical analyses. The cultural literature is very diffuse, spread over many 
areas of substantive concern, rather than concentrated into “a literature” on 
the symbolic structure of work. 

The second general area is research at the individual level. The first major 
subcategory here is study of individual characteristics of workers, in which 
most effort (22%) is concentrated on ascribed characteristics (like gender, 
race, age) rather than achieved ones (education, skill, and sexual orienta- 
tion—3%.) Education appears often as a control variable, but it is seldom 
more than that. Age (2%), too, is a control variable rather than a central focus 
in most cases. 

Among articles on (individual) qualities of the work experience, the major 
interests are psychological (including stress, motivation, and job satisfaction— 
10%) and economic (wages and fringe benefits—6%). There are also concerns 
for general qualities of work (working conditions and inequality in general— 
5%, mostly on inequality) and for health consequences of work (5%). A 
smaller fraction (396) concerns social qualities of work (e.g. lifestyle). 

Relatively less attention is paid to stages of the individual work experience. 
Here the major topic is unemployment (596 — 51 articles), followed distantly 
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by careers (29), career choice (20), and employment opportunities (14). 
Surprisingly, there is little about layoffs or retirement. 

A third general area 1s structural analysis. A substantial part of the literature 
considers some general structure (e.g. labor, management, occupations, 
bureaucracy, markets—1796), general process (e.g. individual mobility, 
migration, labor movement—10%), or general indicator (e.g. productivity, 
industrial dominance-—-5%) of the work system. The important structures are 
labor, management, and (individualistically conceived) labor markets. The 
important process is individual status mobility. The important indicator is 
productivity. The shadow of The American Occupational Structure falls even 
on this "structural" level. 

Only about 1096 of the literature concerns the division of labor. Most of 
this work is about technology (40 articles) on the one hand or about the twin 
divisions of labor of home and work (26 articles) on the other. À slightly 
larger fraction of the literature (14%) studies the social control of work, much 
of it covering several issues at once (64 articles), although there are specific 
foci on ethics (40) and professionalism (18). Surprisingly, there 1s little or no 
focus on the temporal structure of work (1%), including such topics as 
flex-time, part-time work, moonlighting, and job-sharing. There is much 
speculation about the role of temporal structure, particularly in relation to the 
gender wage gap, but little research. 

Finally, in this structural area, about one fifth (18%) of the literature deals 
with topics in what is usually called industrial and labor relations. Unions 
figure in one tenth (113) of the articles (second only to gender), but there are 
also many articles on IR generally (52), and not a few on collective bargaining 
(21) and strikes (20). 

The gap between structure and individual characteristics is bridged in part 
by my fourth general area, writing that considers a particular occupation, 
occupation type, or sector of employment. About a third of articles concern 
a limited group of workers. Some 10% concern a general category like white 
or blue collar or professionals. Fully a quarter concern a particular type of 
occupation. Of these the most common are doctors (54 articles), lawyers (27), 
nurses (25), teachers (22), university faculty (22), engineers (16), social 
workers (13), miners (12), scientists (9), and journalists (6). The literature’s 
love of professions, indeed of itself (academics tie for fourth), is untarnished 
by time. 

It is much less common (15%) to focus on a specific area of employment. 
Of these the most common is manufacturing (56 articles), with academia (24) 
second, and government (20) third. Home work (9 articles) nearly disappears, 
as does agriculture (There is, of course, a separate SA heading for rural 
sociology.) 
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To uncover the geography of the WO literature, one must move beyond 
marginals to cooccurrence. To do this, I have analyzed PAIRS of topics. For 
example, the most common pairing of topics in this data set, not surprisingly, 
is gender and wages, a combination considered in 25 articles. (Other topics 
can of course be considered in these articles as well.) Eighteen articles consider 
gender and inequality in general, while 17 consider both general inequality 
and wages in particular. I have considered all pairings of topics that appear 
in more than five articles, an arbitrary but useful cutoff. There are 52 such 
pairings. 

That the network of topics includes 39 topics for these 52 connections tells 
us that the literature is not organized as a small group of central topics 
considered again and again in varying combinations. Quite the contrary, the 
pattern is of three or four principal topic areas largely disconnected from each 
other, each with its own hinterland of specialty topics. 

As we have seen above, the two principal topic areas are gender and 
unions/IR. Astonishingly, fewer than six articles consider gender in the context 
of unions or industrial relations. Yet the relation of gender and work was 
centrally structured by unions, as a brilliant historical literature on the “family 
wage” has shown. Our ahistorical, individualistic research has forgotten that 
gender 1s more than being female in a regression equation. 

Six topics are directly connected to gender and to nothing else. (I give 1n 
parentheses the number of joint appearances 1n articles.) They are: careers 
(8), career choice (9), family roles (14), stereotypes (6), sexual discrimination 
(10), and lawyers (7). It 1s surprising that there aren't more internal 
connections, since, with the exception of lawyers, these topics seem all of a 
piece. In addition, three topics are connected to gender and to one or more 
other topics that have no further connections: job satisfaction (8 to gender, 6 
to nursing), academic employment (6 to gender, 9 to university faculty), and 
unemployment (6 to gender, 8 to age, 6 to labor markets). It is clear from 
this pattern that sociologists treat the issue of "stages of work experience" 
basically as a gender issue, not as a general one. With the legal retirement 
limit rising, actual retirement age falling, and sweeping layoffs occasioned 
by the recession and international restructuring, one would expect a more 
general interest in career contingencies than merely in the problem of family 
and work (Doeringer 1990). 

As I have already noted, gender is the biggest member of a loose clique 
of topics—gender, wages, general inequality, race, and individual status 
mobility—that dominates this "side" of the literature. Off to one side of this 
clique is a less important core topic, general theory, connected to gender by 
nine cómmon appearances and via the topic of professions, which has six 
joint appearances with each. Theory has four topics connected only to itself: 
capitalism (8 ties), professionalism (6 ties), intellectuals (6 ties), and cultural 
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images (7 ties). In this second cluster, then, are the principal theoretical 
literatures on work, the Marxist literature with its focus on capitalism, the 
professions literature, and the newer, somewhat chaotic literature on cultural 
images and work. 

The other principal nexus of topics concerns industrial and labor relations. 
The topic of unions is connected to IR by 14 direct ties, as well as via 
manufacturing employment (8 ties to unions, 6 ties to IR) and via collective 
bargaining (11 ties to unions, 6 ties to IR). This is a more clique-like structure; 
many articles consider several of these topics together. The identity of the 
union literature with manufacturing is curious, since unionizatian in that sector 
has plunged in the United States, to be replaced by public sector unionism 
(AFSCME and NEA) that is relatively more female. Again the lack of a 
gender connection is surprising. 

The union topic area lacks the “hinterland topics” that gender has. It has 
only one such tie, to social control of work (6 common appearances). 
However, social control of work in turn relates to worker health (6 ties) and 
to doctors (9 ties), which in turn has 6 ties to ethics. The control theme thus 
has two faces: one involved with labor/management relations, the other with 
work practices in particular occupations. (For want of space, I do not review 
this control literature below.) 

Between unions/IR and gender lie a number of broker topics. That is, 
articles appear about unions and these topics, as about gender and these topics, 
but not about all three together. One set of brokers are general work structures; 
labor (8 ties to gender, 7 ties to unions) and management (6 ties to gender, 
6 ties to unions, 6 ties to IR). The two are tied to each other 7 times, and 
labor is not surprisingly tied to unions via the topic of organization, which 
has 6 ties with each. Again, the principal conclusion here is a negative one. 
One literature sees an important articulation between gender and labor and 
management while another studies the obvious connection betveen labor and 
management and the formal structuring of IR and unions. But the two 
literatures don't communicate. 

The other principal bridge between the gender topic area and the unions/IR 
one is brokered by race and by the "other" category of specific occupations. 
Again, this means that gender 1s often considered within the context of a 
single occupation (11 ties), as is unionism (7 ties), but the two are not 
considered simultaneously. Similarly, race and gender (14 ties) are considered 
together, as are unions and race (8 ties). Further, race is connected via its 10 
ties with general inequality to the tight little grouping of gender, wages, and 
general inequality that is the heart of the gender literature on work. There is 
also a separate tie between race and gender via individual status mobility (7 
ties to race, 11 ties to gender). 

In addition to showing the major holes in the literature, this network 
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structure identifies the major discrete areas that require survey here. I review 
the literature’s actual findings under these headings, using merely exemplary 
titles. There is not space for everything relevant. 


SPECIAL TOPICS 


Gender and Career Contingencies 


Gender’s influence on job satisfaction seems peculiar. Women are more 
satisfied than “objective” criteria suggest they ought to be, but several studies 
show that this does not reflect use of different job-rating criteria. The 
difference seems to lie in choice of reference groups, women often comparing 
themselves only to other women (Loscocco 1990, Loscocco & Spitze 1991, 
Hodson 1989). 

The gender and career/career-choice literature focuses mainly on the 
sex-segregation of occupations and on the behavior of women in traditionally 
male occupations. Reskin & Roos (1990) attribute segregation to the inter- 
section of employer ratings of potential employees with worker ratings of 
prospective jobs. This is actually a general theory of labor markets treated 
only in its gender connection, a limitation some writers have also noted in 
the comparable worth literature. Bradley (1989) also covers this area, as does 
a special issue of Work and Occupations (18:4, 1992). The literature on 
women ın traditionally male occupations focuses on the (to the authors) 
surprising similarity of these and other working women, particularly in reasons 
for occupational choice, working, etc. Authors are also surprised at the 
persistent strength of SES effects and economic need on occupational 
aspirations and choice (Timmins & Hainesworth 1989, Poole et al 1990, 
Padavic 1991). 

Another small literature considers work discrimination and harrassment at 
work. Considering the political importance of the topic, the lack of research 
is surprising. There are a few ethnographies of harrassment at work (Yount 
1991, Gruber 1989), and a nice history of overt discrimination in compensa- 
tion programs (Kim 1989). A number of historical works (like Lowe 1987 
and the Reskin & Roos 1990 case studies) consider the mechanics of deliberate 
feminization of occupations. 

A justly large literature considers the relation of family and work. A variety 
of articles consider usual topics like the need of complex compromises, the 
employer preference for single women (Peterson 1989), and the role of 
reconceptions in shaping perceptions of working mothers (Etaugh & Nekolny 
1990). Several books emphasize the importance of complex contingencies in 
shaping the articulation of family and work (Stichter & Parpart 1990, Beach 
1989, Selby et al 1990). Many also emphasize the enormous influence of 
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differing cultural traditions on this mediation (Stichter & Parpart 1990, 
Standing 1991, Ramu 1989). Several of these (e.g. Ramu 1989) show the 
enormous persistence of traditional attitudes in women undertaking modern 
work. Overall, this literature shows that analyses based on the half-century 
transformation of American mothers into wage workers cannot produce a 
general model for the relation of family and extra-family divisions of labor. 
Not only are there many other ways of working this relation cut, but those 
other ways point out things earlier analysts missed in the American experience. 
One cannot discuss this issue without a full analysis of work and family 
culture, of complexes of structural forces that impinge on the local work 
world, and of the creative response of family members—men and women 
alike—to them. Serious readers of Tilly & Scott will recognize this as an old 
lesson. But parts of the literature continue as if they hadn’t learned it. 


Gender and Specific Occupations/Areas of Employment 


Articles about gender and particular occupations are of two kinds. One kind 
compares men and women, generally finding the expected ditferences—in 
rewards (persisting after controls) and attitudes. Ott (1989) finds differences 
in the rewards of men and women under different kinds of gender mixes. 
Martin (1990) finds differences 1n the attitudes of men and women judges. 
Roach (1990) notes differences in achievement by men and women in law. 
Yoder et al (1989) note differences in academic hiring consequent on 
departmental gender mix. The second kind of article compares women to 
other women and finds extensive variety among them. Gray (1989) finds 
differences in nurses' organizational commitment reflecting feminist ideology 
and presence of children. Rosenberg et al (1990) distinguish several types of 
women lawyers, while Briscoe (1989) notes varying attitudes of women 
lawyers toward entering politics. Henry (1990) 1s worried by the antifeminist 
scripts of senior women academics. In the dozens of articles here sampled, 
these two strategies are never conducted concurrently, so that varieties of 
women are compared to varieties of men. This is clearly a major problem. 

In books on individual occupations, many of these themes recur. Kingsolver 
(1989), Kesselman (1990), and Fine (1990) all testify to the complex responses 
of women in particular occupational settings. Most important, all three testify 
to the centrality of forming a consciousness (both personal and corporate), 
making a set of symbols, and coming to an appraisal of the situation. Here, 
too, we find diversity of women's responses (as in Abir-Am & Outram 1987 
on women scientists). 

The antinomy between the simple male/female opposition and the diversity 
of women when considered alone is the basic conundrum of this literature, 
indeed, of the gender and work literature as a whole. The notion of women's 
oppression is so central to most writers in these literatures that they are 
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bewildered or angered when their subjects reject it (as in the elitism of some 
of Abir-Am & Outram’s scientists or the acceptance of clerical subordination 
in Fine). Yet at the same time writers see enormous variety in the responses 
of women to situations, a variety that probably—few have studied this question 
sympathetically—occurs similarly among men. A reminder that other solidar- 
ities can supersede gender comes from Hine’s (1989) book on black nurses. 
Like the literature on gender and career, then, the literature on gender and 
occupations is vibrant and exciting, if occasionally one-sided. The best of it 
shows conclusively how the structures of culture, tradition, economy, class, 
and social organization create frameworks within which women (and some- 
times men) transform their world of work, indeed their whole lives, and, 
ultimately, those structures themselves. This literature proves that the static 
approach of the individualist statistical studies can tell us little or nothing 
about why the relation of gender and career, family and work, looks the way 
that it does. 


Gender, Race, Wages, Inequality, and Status Mobility 


One subtopic commands a substantial portion of this literature: comparable 
worth (CW). Many of the writers on CW are not sociologists; CW is an 
applied topic most often researched in business schools, by professors of 
organizational behavior and psychology (only one author in the 1989 special 
issue of Journal of Social Issues on CW is a sociologist). Most articles on 
CW are frankly prescriptive. Their politics is well analyzed by Hegtvedt 
(1989), who also locates the CW literature within the theoretical literature on 
fairness. Some of the empirical work notes, again, the disturbing toleration 
of wage gaps by women (Major 1989), while others have noted that evidence 
for direct bias in job evaluation is fairly rare (Mount & Ellis 1989) although 
historical factors (Kim 1989) and knowledge of existing wage differences 
(Mount & Ellis 1989) provide avenues for indirect bias. By far the best studies 
of CW are the three book length case-studies (Acker 1989, Blum 1991, and 
Evans & Nelson 1989), which provide cautionary tales about the complexities 
of actual politics, the difficulties between elite feminist reformers and 
workaday women, the complex cross-cutting of gender and class interests, 
and (here at last) the intricate relations between CW advocates and the unions. 
These books are all thought-provoking analyses. 

Beyond the literature on CW, the articles on gender, race, and various 
forms of inequality are a cacaphony. Their time frames run from 50 years to 
20 years to 5 years to none. Their spatial units run from countries to industries 
to states to firms to jobs within firms. We can draw no general conclusions 
from combining these articles because we have no theory saying how these 
different levels and periods of social reality relate to one another. Fillmore’s 
(1990) 50-year study of Canadian census documents may support a reserve 
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army theory of women’s labor force participation, but how do we reconcile 
that with Rosenfeld & Kalleberg’s (1990) equally interesting cross-sectional 
finding that countries with more occupational sex-segregation can have more 
equal income distributions (Sweden)? We have state-level census data showing 
that unionization lowers wage inequality, panel survey data showing that 
gender gaps have worsened, firm-level data showing blacks and women do 
badly, occupational data showing that although selection bias lowers the 
effect, women in female-dominated occupations still make 6 to 15% less than 
equivalent women in other occupations, and so on and so on. The only 
common theme of these results is that women always do somewhat worse 
than men. Politically, such findings are useful; intellectually, they are old 
news. Without a theory linking large-scale change to local structures and local 
structures to individuals, we cannot synthesize such findings. There is obvious 
concern that classical Marxism, which used to provide a general theory for 
some people, just doesn’t work. 


Gender and Labor/Management 


There is a very small literature on gender and management, mostly on women 
managers. Thus, Chusmir & Mills (1989) note differences in conflict 
resolution styles but feel that job level makes more difference than gender. 
Wiley & Eskilson (1990) show the influence prospective images are likely to 
have on gender management styles. There is also a literature on management 
careers and gender, which has much the shape suggested above. There are 
also a few books on gender and labor. Gabin’s (1990) excellent history of 
women and the United Auto Workers gives a detailed, practical portrait of 
women’s successes and failures on the inside, as do Kingsolver’s (1989) 
wonderful book on women in a mining strike and a number of the books on 
CW. This area of the relation of women to the principal corporate structures 
of the work world is understudied in WO, although there is some writing on 
it in the management literature proper. 


Unemployment vis a vis Gender, Age, and Labor Markets 
Generally 


The literature on unemployment (UE) is extensive and cosmopolitan, the latter 
fact reflecting the larger size and greater political importance of UE in Europe. 
The literature is chiefly focused on age and labor market policies. There is 
some question whether increased female labor force participation has increased 
UE generally, with mixed evidence (McCarthy 1988, Furaker et al 1990). 
DeBoer & Seeborg (1989) attribute the sharp convergence of male and female 
UE rates partly to changing labor force attachment and partly to the interaction 
of occupational sex segregation with sectoral differences in UE. 

A large literature considers UE and age, focusing on the beginning and 
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ending of the work life course. There is increasing attention to the fuzziness 
of entry into and departure from the labor force. Hartley (1989) considers the 
impact of youth wage rates on labor force entry; Singell & Lillydahl (1989) 
consider various definitions for youth UE; Sullivan (1989) considers the 
interaction of crime and work in youth; and Weis (1990) examines a high 
school’s responses to deindustrialization. An even larger literature considers 
retirement, now recognized to proceed by fits and starts. (Weiller 1989 gives 
a historical background.) Some consider the response of older workers to 
career displacements (Rife & First 1989), while others work at disentangling 
early retirement from UE (Casey & Laczko 1989), itself complicated by older 
workers’ interpretation of their segregation into special labor markets (House 
1989). Schulz et al (1991) and Szeman (1989) study the complex interactions 
of policy in this area, the latter being an elegant study of the emergence of 
UE in a pensioners’ labor market. 

Labor market policy provides the other major theme in this area. A number 
of writers emphasize the wide diversity of policies (Evers & Wintersberger 
1990, Grahl & Teague 1989) while noting that different policies may yet 
produce similar UE rates. A surprising number of works refer, directly or 
indirectly, to the cultural construction of unemployment, by workers and 
policy makers. This construction can be analyzed at the national level (Janoski 
1990), within a particular policy (Miller 1991), or among individual workers 
(Pappas 1989, Mandler 1990. See also Salais et al 1986). Work on UE is 
beginning to be complemented by sophisticated work on structures of 
employment itself. Korver (1990) and Jacoby (1991) provide general histories 
of forms of American employment, while DiPrete (1989) gives a detailed 
analysis of the Federal civil service. 


Unions and Labor Relations 


The literatures clustered around the union and IR topics are much more 
international than the literatures around gender. This reflects not only the 
currently declining fortunes of American unions, but also the greater institu- 
tionalization and political importance of both organized labor and labor 
policies in other advanced countries. 

There are many case studies. Bracho (1990) examines the demise of a 
Mexican cadre union at the hands of peasant and worker confederations. 
Berger (1990) studies gender conflicts in a South African canning union, 
noting the interaction of anti-apartheid politics and union politics. Barber 
(1990) studies a fish processing plant union, emphasizing that labor action is 
possible only when larger structural circumstances interact with local cultural 
values to provide opportunities for locally meaningful action. The emphasis 
on interaction of micro and macro reaches its peak in Cornfield's (1990) 
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massive study of the furniture workers' union, where status conflicts generated 
by external pressures drive union change. 

A number of studies consider broader issues like the emergence of overall 
styles of IR. Patmore (1988) tells how arbitration procedures in Australian 
railroading created openings for union activity, while Kelly (1988) considers 
the much later emergence of tripartite structures in Australian IR. Hertle & 
Kadtler (1990) see the move of German paper and ceramics unions toward 
industry-wide policy as key to union success. Teage (1989) considers attitudes 
of European workers toward transnational unions possible under the EC 
(attitudes are mixed.) Here too, the theme of culture makes an appearance. 
Cohen (1990) attributes much of the peculiarity of American labor relations 
to the influx of British workers into a labor system that had had no phase of 
large-scale domestic production, although Haydu (1989) finds that effects of 
changes in production patterns swamp those of cultural differences in his study 
of open shops early in the twentieth century. 

The literature on collective bargaining proper is another of those relatively 
complex literatures whose chief implication is that complex conjunctures are 
central. Miller & Canak's (1991) case study of public sector collective 
bargaining in Florida provides a good example, explaining why local 
conditions undermined the "normally expected" effects of the "usual vari- 
ables." Public sector collective bargaining groups are equally a focus for Troy 
(1990), who wants to explain their strength, and Gagnon (1990), who wants 
to explain their divisions. There are the usual attempts to explain the 
dominance of bread and butter issues for American unions (e.g. Taplin 1990) 
when contrasted with broader, neocorporatist bargaining in Europe. 

More general work continues the same themes. Clark's (1989) study of 
American union decline in the 1980s again focuses on a conjunction, in this 
case of internal tensions, economic restructuring, and conscious anti-union 
policy. Baglioni & Crouch's (1990) review of European IR sees less 
convergence of forces and consequently more various outcomes, although 
noting the general shift in industrial relations toward management initiative 
and dominance. 

Also repeated in this broader literature is the emphasis on the culture, the 
symbolic and subjective representation of work. Sometimes, this involves 
direct study of worker attitudes to work itself, as in Lincoln & Kalleberg's 
(1990) quantitative attempt to distinguish Japanese & American attitudes 
toward work. In a different style, Vallas (1991) opposes standard hegemony 
theories of worker attitudes toward management, while Bodnar (1989) turns 
to detailed oral histories to reconstruct collective constructions of life in an 
automobile plant. A variety of work considers political attitudes of workers 
and managers, the adversarial attitude often being singled out (particularly in 
applied literatures) as "problematic." [See Peck & Hollub (1989) for an 
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empirical study of this situation, and Lambelet & Hainard (1989) for a 
theoretical one.] Survey studies of workers document precisely that Michelsian 
view of union leaders (Golden 1990) whose error Cornfield (1990) was at 
pains to show in his book. 

Study continues of worker participation; essays in Sirianni (1987) provide 
a useful cross-section of these. Grootings et al (1989) also review the area. 
À more analytical attempt is by Tsiganou (1991), who places all the new 
participative schemes in a common and explicit comparative framework. A 
few writers (e.g. Wever 1989) have begun to examine the specific conditions 
for participation schemes with standard methods. Results seem to suggest as 
crucial the conjunction of union security, union economic leverage, and some 
common vision between labor and management. The absence of these may 
explain the concern, here and there in the literature, that the current wave of 
worker participation schemes is over. 


Unions and Race 


The exciting and angry literature on unions and race is largely within the 
discipline of history. It is today making a transition that the historical literature 
on labor and gender made more than a decade ago: a transition in which the 
“good guys" of the new labor historians are recognized to have partaken not 
only of the best but also of the worst of their cultures. Asher & Stephenson 
(1990) provide a collection looking on the positive side of race relations within 
the labor movement. Hirsch's (1990) immensely detailed study of Chicago 
labor focuses on culturally rooted ideologies and gives a more complex, and 
on balance more negative, picture. There has been frank debate about racism 
in the unions, particularly in the UAW, sparked by the work of Hill (1987, 
1989), Responses to Hill's analysis range from mild apologetics through 
agonized fence-sitting to angry denial. The reader may choose his or her own 
version. (See various writers in New Politics 1:3, 1987, and International 
Journal of Politics, Culture and Society 2:3, 1989.) 


Work and Culture 


The area of work and culture has several parts. The first concerns ethnicity. 
Most study of ethnicity and work is done in books, perhaps because what we 
often take as a simple variable breaks down, on serious examination, into a 
diverse bundle of phenomena. Studies here include Jesuadson's (1989) 
examination of the interaction of state, ethnic groups, and external forces in 
Malaysia, Bourgois's (1989) study of the diversity of ethnic groups on a 
banana plantation, and Peled's (1989) analysis of class and ethnicity among 
Jewish workers of the Russian Pale. As is usual in books, there is emphasis 
on complexity and contingency, although all three take environing economic 
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forces as exogenous and all emphasize (especially Peled) the construction of 
ethnicity in the face of work interactions. A somewhat related literature takes 
up the impact of national level cultures on transnational phenomena like 
capitalism, a topic reviewed by Clegg & Redding (1990). (See also Gullick 
1990 on expatriate managers.) There is much writing, now more now less 
applied, on "organizational culture" (e.g. Paules 1990, D'Amico-Samuels 
1990). 

By far the most exciting study of culture and work concerns the imagining 
of work itself. Some of this study has concerned occupations (e.g. Symonds 
1991, Whalley 1987). But the most exciting work concerns occupational 
statistics. A handful of people—Margo Anderson (Conk 1980), Simon 
Szreter (1984, 1992), and others—have definitively shown that we cannot 
take census occupational categories as immediately meaningful, either 
synchronically, as in the DOT, or diachronically, as in the quixotic effort 
of the Minneapolis 1880 Census Project to render “comparable” all US 
occupational classifications back to 1880. In France, official statisticians 
themselves lead the way in this reanalysis of occupational categories; Alain 
Desrosiéres of INSEE invented the 1982 occupational classification, and he 
himself (Desrosiéres & Thévenot 1988) has written the analysis deconstruct- 
ing it. Robert Salais and others (1986) have written an equally brilliant 
book on the concept of unemployment. [Work on this area has appeared 
in such diverse places as Finland (Kinnunen 1988) and Portugal (Ravara 
1988), work I am sadly unable to read.] A central driving force in this 
reappraisal of occupational categories has been recognition of their facile 
occlusion of women’s work, a subject investigated at length by Higgs (1987), 
Folbre & Abel (1989), and others. Beechey (1989) has recently reopened 
the topic with a focus on its implications for cross-national comparisons of 
womens’ work. It is absolutely certain that analysis of the culture of work 
will be one of WO’s major future topics. 


Professions 


Professions continue to be the most effectively theorized of occupations. To 
be sure, a few individual level studies are found (Johnson 1990 and Jolly et 
al 1990), particularly on gender matters. And the steady flow of case studies 
continues (Crawford 1989, Curry 1990, Galanter & Palay 1991, Landon 1990, 
Stebbins 1990), as does comparative study, at least of lawyers (see the three 
volumes of Abel & Lewis 1989). But theonzing dominates. After nearly two 
decades under the "professional dominance" paradigm, sociologists of med- 
icine finally recognized that medicine's precarious position demanded more 
complex theories (see the special issue of Milbank Quarterly 66:Supplement 
2, 1988). A special issue of Sociologie et Sociétés (20:2, 1988) shows far 


204 ABBOTT 


more bravura. Catherine Paradeise (1988) construes professions in terms of 
labor markets, while Magali Sarfatti Larson (1988) turns to Foucauldian fields 
of discourse and Elliott Krause (1988) to relations between state and 
profession. Of these three lines of attack, Krause’s appears to be most 
prominent today, partly because of policy debates on professionals within the 
EC. (See also essays in Torstendahl & Burrage 1990 and Burrage & 
Torstendahl 1990). We can also expect more studies of gender and profes- 
sions, although the mechanics of women's exclusion and of the relations 
between male and female professions are pretty well-studied. As yet unstudied 
in any real detail is the gendering of the concept of expert itself in the 
nineteenth century. 

Abbott (1988) attempted to recast the area with three basic arguments: that 
professions could not be studied individually but only within an interacting 
system, that a theory of professions had to embrace not only culture and social 
structure but also intra-, inter-, and transprofessional forces, and that the 
development of professions would necessarily be a matter of complex 
conjunctures. None of these arguments has had much impact. The jurisdic- 
tional studies Abbott called for have not appeared; linear studies of individual 
professions continue to dominate (e.g. Wenocur & Reisch 1989, Junqueira 
Botelho 1990, Brain 1991). Moreover, recent studies have emphasized either 
the cultural (Brain & Larson) or the social structural (like most work on 
relations with the state), and none systematically pursues multilevel analysis. 
Finally, although ridiculed by Abbott, the search for "determining variables" 
continues (e.g. Raelin 1989). 


Theory 


Capital is surely the archetypical examination of work: a study ranging from 
micro to macro, from social structure to (once in a while) culture, systemat- 
ically drawing together a theory of economy, of work, of organization, and 
of association and placing the whole within a formal conception of the 
historical process. It is therefore not surprising that most synthetic studies of 
work are Marxist in lineage if not in fact. Yet much of what seems general 
theoretical work ıs simply macro rather than synthetic. Thus we have 
literatures on macro topics like classes (Zeitlin 1989, Swenson 1989, Laba 
1991), organizations (Fligstein 1990, Chandler 1990), and technology (Wright 
et al 1987, Ling 1990, Nash 1989, Morgan & Sayer 1988). There are also 
various critiques of economic reasoning (Block 1990, Fried] & Robertson 
1990). But true multilevel work is mainly within case studies, rather than in 
the theoretical literature itself. Jn particular, the problem of transnational 
capitalism's impact on individual localities has drawn some interesting 
multilevel work. 
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CONCLUSION 


The theoretical problem of reconciling the micro and the macro is general in 
sociology and crucial in WO, as Kalleberg (1989) recognizes. But our official 
theorists do little more than relabel the problem with new words like 
structuration. A serious theory of micro to macro can emerge only from an 
empirical area. In WO, it will emerge from people working with multilevel 
data on work: data that brings together exact career information (micro), 
network structure among careers and jobs (meso), and occupational/organi- 
zation level information (macro) on occupations and work structures in conflict 
and process. At present, only historians and historical sociologists have taken 
the time to gather such datasets, and usually only on professions or skilled 
laborers. In the meantime, lack of theory covering multiple levels in both 
space and time makes most pieces of quantitative research on work mutually 
irreconcilable and hence meaningless. It remains true that for all our modern 
insights, there is only one truly general theory of work, that of Marx. 

Yet all the formal attractions of Marx's work cannot hide its present 
problems—its assumptions about the nature of work (see Seidman 1991) as 
much as its magnificent failures as a guide for policy. And ironically, although 
the formal beauty of Marx's theories means little to the individualist literature 
on work, in the hands of many current mainstream workers the spiritual values 
of his theory—the union of theory and practice and the hatred of oppression— 
have become that literature's chief cornerstone. For the last decade or more, 
the best study of work has without question come from researchers whose 
passion to right wrongs spoke in every sentence. In my view, that commitment 
is beginning to obscure our understanding. That studies attend to diversity 
among women only when men are completely off-stage is one indicator, as 
is the common treatment of men as a unified, homogenous, oppressive group. 
Another is the insistence that "there is no sorrow like my sorrow,” for example, 
the peculiar focus on comparable worth across gender to the exclusion of 
other differences (q.v. Mount & Ellis 1989, Hine 1989.) Still another is the 
common anger at research subjects, both past and present, who refuse to 
recognize what we now "know" to be their oppression. More fools they. 

The move of historical studies of gender and work toward explicit 
recognition of the complex, contingent responses of men and women to 
changes in their world is a move in the right direction. Like Marxist study of 
Work before it, feminist history of work has been led by its focus on lived 
experience to recognize the complex contradictions of life that ideologies often 
ignore. But there is a long way to go. At the deepest level, those focused on 
oppression have to confront anti-hierarchicalism for the culture-bound, 
Western ideology that it is, perhaps by frankly confronting what kinds of 
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hierarchy they might consider just. This problem may be under discussion 
off-stage, in theoretical writing I don’t know. But here in the area of WO, 
our day-to-day experience is that most of the really exciting work—the work 
with new data, new questions, and new solutions—has a consistently 
politicized tone, while most of the apolitical work is intellectually sterile. We 


die between. 
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Abstract 


Neither nationalism nor ethnicity is vanishing as part of an obsolete traditional 
order. Both are part of a modern set of categorical identities invoked by elites 
and other participants in political and social struggles. These categorical 
identities also shape everyday life, offering both tools for grasping pre-existing 
homogeneity and difference and for constructing specific versions of such 
identities. While it is impossible to dissociate nationalism entirely from 
ethnicity, it is equally impossible to explain it simply as a continuation of 
ethnicity or a simple reflection of common history or language. Numerous 
dimensions of modern social and cultural change, notably state building (along 
with war and colonialism), individualism, and the integration of large-scale 
webs of indirect relationships also serve to make both nationalism and ethnicity 
salient. Nationalism, in particular, remains the pre-eminent rhetoric for 
attempts to demarcate political communities, claim rights of self-determination 
and legitimate rule by reference to “the people” of a country. Ethnic 
solidarities and identities are claimed most often where groups do not seek 
“national” autonomy but rather a recognition internal to or cross-cutting 
national or state boundaries. The possibility of a closer link to nationalism is 
seldom altogether absent from such ethnic claims, however, and the two sorts 
of categorical identities are often invoked in similar ways. 


Introduction 


One of the uglier ways in which nationalism gained popular and academic 
attention in the early 1990s was the Serbian program of “ethnic cleansing.” 
When promulgated by a psychiatrist, and other academically trained rep- 
resentatives of modern science, this policy has helped to demonstrate that the 
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nationalist upheavals and ethnic violence that followed the collapse of 
Soviet-style communism were not simply throwbacks to some premodern 
reign of passion, sentiment, and primordial identity. The policy of “ethnic 
cleansing,” like all of nationalism and ethnic politics, depended on social 
constructions of identity, mobilized members of the chosen ethnic group only 
unevenly, and served the interests of some participants far more than others. 
It forced many Serbs who had previously allied themselves with the vision 
of a multiethnic, democratic Bosnia-Hercegovina to resort to ethnic solidarities 
in the face of civil war. Claiming these ethnic solidarities and the identity of 
Serbs as both ancient and seemingly “natural,” the new ideological mobili- 
zation successfully demanded that its adherents be willing both to kill and to 
die for their nation. 

If there were any doubt about the importance of the claimed link between 
ethnicity and national self-determination, the fighting in what was once 
Yugoslavia should have dispelled it. The Yugoslav conflicts, moreover, 
stemmed in part from the very nationalities policy employed by the country’s 
former communist government, both recognizing subordinate nationalities and 
ethnic groups and drawing state lines that intentionally cross-cut ethnic and 
national residential patterns (Connor 1984, Banac 1984). Neither ethnic 
conflicts, nor the discourse of national identity, nor the practical power of 
nationalist mobilizations has receded into the premodern past despite the 
confidence of many earlier social scientists (an embarrassment especially for 
marxists: see Schwartzmantel 1991, Nairn 1975, 1977, Debray 1977). At the 
same time, the idea of the nation remains central to most attempts to define 
legitimate political communities (Brubaker 1992, Harris 1990, Mayall 1990, 
Noiriel 1991b). A central theme in this discourse is the question of the extent 
to which nationalism should be understood as a continuation of long-standing 
patterns of ethnicity, or as something distinctively new and modern. This is 
the focus of the present review.! 


The Modernity of Nationalism 


The discourse of nationalism is distinctively modern. It is variously argued 
to have originated in the seventeenth century British rebellion against 
monarchy (Kohn 1944, Greenfeld 1991, 1992), the eighteenth century 
struggles of New World elites against Iberian colonialism (Anderson 1991), 
the French revolution of 1789 (Alter 1989, Best 1988), and the German 
reaction to that revolution and to German disunity (Kedourie 1960, Breuilly 
1982). But as Best (1982: 29) puts it: “Historians of nationalism agree to 


"There are innumerable other dimensions to the broad literature on nationalism and ethnicity 
which are not covered here The best general reviews are Smith's (1973, 1981), see also Carter 
(1981), Deutsch (1970), Haas (1986) and Noinel (1991b). 
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differ in their estimates of how much of it (and what sorts of it) already existed 
in the Atlantic world of 1785. They are at one in recognizing that that world 
by 1815 was full of it, and that although each national variety had of course 
its strong characteristics, those varieties had enough in common for it to 
constitute the most momentous phenomenon of modern history.” In the early 
modern era the idea of nation as an aggregate of people linked by co-residence 
or common sociocultural characteristics took political and cultural connota- 
tions in struggles with and between states and over state-building. This led 
to the distinctively modern invocation of nationalism as “a theory of political 
legitimacy, which requires that ethnic boundaries should not cut across 
political ones, and, in particular, that ethnic boundaries within a given state—a 
contingency already excluded by the principle in its general formulation— 
should not separate the power-holders from the rest" (Gellner 1983: 1). As 
Kedourie (1960: 9) summed up a generation before, the discourse of 
nationalism ideal-typically offers three propositions: "that humanity is natu- 
rally divided into nations, that nations are known by certain characteristics 
which can be ascertained, and that the only legitimate type of government is 
national self-government." 

Nationalism has become the preeminent discursive form for modern claims 
to political autonomy and self-determination. The term was apparently coined 
in German by the philosopher Herder (Berlin 1976: 181) and in French by 
the Abbe Barruel (O'Brien 1988: 18) just less than 200 years ago. It was 
linked to the concept of nation-state in the notorious formulations of Woodrow 
Wilson and the League of Nations (Mayall 1990: 44—45, Kohn 1962: 133—35). 

In the wake of communism's collapse, nationalism and ethnic conflict 
appeared as the primary issues in the realignment of Eastern European politics 
and identity (Chirot 1991, Tilly & Walker 1991). Indeed, in many instances 
communist governments had been actively involved in nationalist mobilization 
themselves, in varying degrees cynically and idealistically (Verdury 1991, 
Connor 1984). Appeals to the idea of nation organize movements of ethnic 
separatism from Quebec (Birch 1989, ch. 8; Fenwick 1981, Letourneau 1989, 
Little 1989, Taylor 1992) to the postcolonial states of Africa (Davidson 1992, 
Lewis 1983, Markakis 1987, Mazrui & Tidy 1984, Nzongola-Ntalaja 1987, 
Selassie 1980). Nationalism is equally prominent in movements to integrate 
disparate polities, as in twentieth century Arab nationalism (Anderson et al 
1991, Farah 1987, Tibi 1990) and nineteenth century German nationalism 
before it (Coetzee 1990, Eley 1980, 1992, Meinecke 1970, Mosse 1975, 
Sheehan 1978). New nationalisms proliferate throughout the developed West 
(Tiryakian et al 1985, Smith 1981), and attempts are made to decolonize the 
discourse of nationalism in the Third World and claim it for indigenous 
movements and meanings (Blaut 1987, Chatterjee 1986). In East Asia, 
nationalism has throughout the twentieth century been the rhetoric not only 
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of anti-imperialist struggles but of calls for strengthening and democratizing 
states from within (Chow 1960, Spence 1981, Schwarcz 1986, Wells 1991, 
White et al 1990). Nationalism is anything but a thing of the past, thus, and 
even the newest claims to nationalism are often rooted in a rhetoric of 
pre-existing ethnicity. 


Nationalism as Discourse 


Yet, despite this agreement about the contemporary salience of the discourse 
of nationalism, Hobsbawm (1990: 14) makes a sharply contentious assertion 
when he writes “the basic characteristic of the modern nation and everything 
connected with it is its modernity.” Even the repetition of the term modern 
in both subject and predicate of his sentence does not save it from controversy, 
for Hobsbawm is arguing against a widespread view of both academics and 
nationalists themselves. This is the view that modern nations are based on 
ethnic identities that are in some sense ancient, primordial, possibly even 
natural or at least prior to any particular political mobilization. A great deal 
is at stake in this argument. Most crucially, can “nationhood” be taken as the 
prior basis for nationalist claims? Is self-determination, for example, a political 
right to be accorded all “true” nations, as the apostles of nationalism asserted 
in the mid-nineteenth century “Springtime of the Peoples” (Kohn 1962, 
Meinicke 1970, Cohler 1970)? Are Serbs intrinsically a nation, to revert to 
our opening example, such that any claims of multiethnic Bosnia-Hercegovina 
to include large Serbian populations are infringements on the rights of the 
Serbian nation? Or, is “nation” at best a rhetorical mode of making political 
claims, and at worst a way for certain elites to manipulate mass sentiments 
in pursuit of power? In more academic terms, does the prior existence of 
ethnicity explain nationhood, and does nationhood explain nationalism? Or 
is the notion of membership in a common nation (and perhaps even in an 
ethnic group) a product of nationalist (or ethnic) mobilization? Is nationalism 
simply a derivative result of state-formation and other “material” aspects of 
modernization, or is it one of the primary constituents of modernity? 

This issue is hard to keep entirely clear in our minds because most variants 
of nationalist rhetoric claim the nation as an always-already existing basis for 
action, whether as the continuation of ancient ethnicity or as the result of 
historically specific acts of foundation. As moderns we are all participants in 
the discourse of nations whether we like it or not. Many of the categories and 
presumptions of this discourse are so deeply ingrained in our everyday 
language and our academic theories that it is virtually impossible to shed 
them, and we can only remind ourselves continuously to take them into 
account. A simple example is the assumption that “society” is a noun referring 
to self-sufficient units with clear boundaries. Tilly (1984: 11) makes this the 
first of his "eight Pernicious Postulates of twentieth-century social thought": 
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"Society" is a thing apart; the world as a whole divides into distinct "societies, 
each having its more or less autonomous culture, government, economy, and 
solidarity." 


This is a usage produced by the discourse and political salience of the 
modern idea of nation (and specifically its hyphenated conjunction with 
“state”). As Halle (1962: 25) put it, "perhaps the idea alone can give the 
community the singleness and integrity which we attribute to it when we think 
of it as a corporate person." In fact, societies have not always been and are 
not everywhere equally bounded, nor 1s it clear that they are as bounded in 
the archetypal cases of modern nation-states—e.g. France—as ordinary 
language (including ordinary sociological language) implies (Giddens 1984, 
Anderson 1991). Even island Britain manifests a complex history and present 
struggle over external as well as internal boundaries (Samuel, ed. 1989). 

Given the multiple and overlapping networks of our social relations (Mann 
1986 and forthcoming), and given the large scale international flows of our 
ideas, language, and cultural productions (Bhabha 1990), it should perhaps 
be a matter of principle to avoid using terms like society as though they 
referred to unitary, clearly demarcated objects. But this would be an extremely 
difficult principle to live up to. However sincere our intention to speak only 
of more or less consolidated patterns of social organization, more or less 
overlapping and densely integrated networks of social relations, more or less 
homogeneous cultural forms and contents, etc, we should soon be driven to 
speak both in proper nouns of Indians and Germans, Koreans and Kenyans, 
and in common nouns of societies or peoples. We live in a world-system 
which is organized into states and which thematizes certain cultural differences 
as constituting "cultures," while others are suppressed as unimportant internal 
or cross-cutting variations. This world-system makes both nationalism and 
claims to ethnic identity as problematic as they are imperative, even while it 
makes it hard to escape enough from the power of received categories to 
understand why they are problematic. 

This is one reason why "nationalism" and corollary terms like "nation" 
bave proved notoriously hard concepts to define (Alter 1989, Breuilly 1982, 
Connor 1978, Kemilainen 1964, Smith, 1973, 1983). The notion of nation is 
so deeply imbricated in modern politics as to be "essentially contested" (in 
Gallie's phrase), because any definition will legitimate some claims and 
delegitimate others. It also reflects more general problems with essentialist 
definitions (Fuss 1989: 2—6). Nation and nationalism are among those terms 
used to refer not to any clearly definable set, the members of which all share 
some common features which nonmembers lack, but rather to a cluster of 
"family resemblances” (in Wittgenstein's term). All of the available essen- 
tialist definitions are unstable and inherently contestable, thus, not only 
because they bias usage for or against various political claims, but because 
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they are based either (i) on qualities which putative nations or nationalist 
movements share with admitted non-nations (such as ethnicity), or (ii) on 
qualities which are not clearly shared among all recognized members of the 
set of nations (like control over or ambition to control a state). 

Though nationalisms are extremely varied phenomena, they are joined by 
common involvement in the modern discourse of nationalism. They are 
common objects of reference in international law, political debate, and even 
economic development programs. As Anderson (1991) has stressed, once the 
idea of imagining political communities as nations was developed, it was 
*modular" and could be transplanted into a wide range of otherwise disparate 
settings. This is what raises the issue of whether Third World or postcolonial 
nationalisms express "authentic" indigenous concerns or are in some sense 
derivative discourses (Chatterjee 1986). The discourse of nationalism is 
inherently international. Claims to nationhood are not just internal claims to 
social solidarity, common descent, or any other basis for constituting a 
political community. They are also claims to distinctiveness vis-à-vis other 
nations, claims to at least some level of autonomy and self-sufficiency, and 
claims to certain rights within a world-system of states (Seton-Watson 1977, 
Breuilly 1982, Mayall 1990). In other words, however varied the internal 
nature of nationalisms, 1n other words, they share a common external frame 
of reference. Thus, even if nationalist claims to primordial origins, ancient 
ethnic pedigrees, or hallowed founding histories were all true, thus, and even 
if every nation had premodern roots (something manifestly impossible in the 
case of such settler societies as the United States, Australia or South Africa—at 
least as defined by their European populations), nationalism would still be a 
modern phenomenon. This is true even of "extreme" forms such as National 
Socialism, despite the tendency of modernization theorists and others to treat 
Nazism as a throwback to the premodern (Talmon 1952, 1960, Bendix 1964) 
rather than a problem of modernity (Alter 1989, Herf 1984). Indeed, this 
phenomenon of claiming state-centered political rights on the basis of 
nationhood is arguably one of the defining phenomena of modernity. 


The Centrality of States 


Those who argue for the priority of nations over nationalism (Armstrong 1982, 
Marcu 1975, Smith 1986) seldom dispute the distinctiveness or centrality of 
modern states. They would follow Tilly's (1990: 2) summary, for example, 
in distinguishing empires, city-states, and other early formations from "states 
governing multiple contiguous regions and their cities by means of centralized, 
differentiated, and autonomous structures." Debate centers on whether nation- 
alism is a by-product of the creation of these states—and accordingly likely 
to disappear as they are transformed in the present era (Tilly 1992). As Tilly 
develops his argument about the distinctive character of modern states, he 
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stresses the consolidation of centralized administrative power, the develop- 
ment of capacities to mobilize otherwise civilian populations (and material 
resources such as industry) for interstate warfare, and the partitioning of the 
world into comparable states. These tendencies tie the politics and social 
organization of such states firmly to the modern era. Ambiguity arises only 
with regard to the role of culture, and more generally the claim of such states 
to be “national,” or of various “peoples” without states to deserve such 
“national” states as a matter of right. Tilly (1990: 3) suggests that we simply 
distinguish "national state" from “nation-state,” restricting the latter term to 
those states “whose people share a strong linguistic, religious, or symbolic 
identity.” National states (though Tilly does not define the term) appear to be 
those which attempt to extend direct rule to their entire populations and expand 
their capacity to organize the lives of the members of those populations, 
whether for purposes of warfare or economic development. They are 
“national” by virtue of their attempt to integrate large populations and 
territories, and by contrast mainly to city-states (that do not fully integrate 
their hinterlands) and empires (that do not attempt to integrate or closely 
monitor the everyday affairs of those they rule). 

As direct rule expanded throughout Europe, the welfare, culture, and daily 
routines of ordinary Europeans came to depend as never before an which state 
they happened to reside in. Internally, states undertook to impose national 
languages, national educational systems, national military service, and much 
more. Externally, they began to control movement across frontiers, to use 
tariffs and customs as instruments of economic policy, and to treat foreigners 
as distinctive kinds of people deserving limited rights and close surveillance. 
As states invested not only in war and public services but also in economic 
infrastructure, their economies came to have distinctive characteristics, which 
once again differentiated the experiences of living in adjacent states. To that 
degree, life homogenized within states and heterogenized among states (Tilly 
1990: 116; see also Watkins 1990 on intrastate homogenization of fertility 
patterns). 

State-building produced a basic discontinuity with earlier forms of social 
organization (Tilly 1975, P. Anderson 1974, Giddens 1984, Poggi 1992, 
Rokkan 1975). At the same time, capitalist economic development knit 
together large-scale markets and transformed the units of economic activity 
and interest (Wallerstein 1974, Balibar & Wallerstein 1991, Hechter 1975, 
Naim 1977, Worsley 1964/1986). Tilly shifts emphasis away from culture 
but does not break sharply with the developmental narrative of older 
modernization theory (e.g. Gellner 1964, Bendix 1964, Apter 1965, 
Eisenstadt 1966, 1973, Smelser 1968). This older modernization theory 
operated with a tension between its assumption of nation-states as basic units 
of modern political economy and its treatment of all "undesirable" forms of 
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nationalism as merely inherited from previous eras, a sort of survival that 
could be expected to wane or moderate into acceptable patriotism in the long 
run even if it contributed to short-term eruptions every now and again (Talmon 
1952, 1960, Parsons 1960). Modernization theory thus predicted that when 
outlying regions were incorporated into a social system they would gradually 
be “homogenized” into cultural similarity with the rest of the system, 
nationalism centered on the encompassing state would grow and contrary 
ethnic mobilization would be transitory. Researchers emphasizing capitalist 
economics more than state development often broke more sharply with 
modernization theory (Wallerstein 1974-1988). Thus Hechter (1975) at- 
tempted to show how ethnic mobilizations in Britain’s Celtic periphery were 
precisely the result of incorporation into British political economy, but 
incorporation in a disadvantaged position. Hechter’s account focuses primarily 
on how economic factors provoked ethnic mobilization; it offered much less 
account of why ethnic identity was salient. This led Smith (1983) to accuse 
Hechter of economic reductionism. The account of nationalism as a peripheral 
response to core expansion at best helps to explain levels of resentment and 
mobilization. It does not address the constitution of national identity or the 
modern conditions of its reproduction (but see Hechter 1987 and Hechter & 
Furtado 1992 for revised arguments). 

The more materialist and state-centered view, moreover, carries a strong 
tendency to see not only nationalism but nationhood as basically following 
from rather than shaping the rise of Buropean modernity. Nations are, in this 
view, produced by the rise of states (and/or the capitalist world-system). As 
Giddens (1984: 116) puts it: 


By a "nation" I refer to a collectivity existing within a clearly demarcated territory, 
which 1s subject to a unitary administration, reflexively monitored both by the 
internal state apparatus and those of other states. . A "nation", as I use the term 
here, only exists when a state has a unified administrative reach over the territory 
over which its sovereignty is claimed. 


In such a usage, the relationship between nationalism and ethnicity is more 
or less coincidental. It 1s the modern state that defines nationhood, and 
preexisting ethnic relations are revised either to coincide more or less with 
its boundaries or to constitute the basis of counter-state movements for the 
formation of new states. Such movements are rooted in power relations, not 
ethnic solidarities and distinctions per se. 

Giddens and especially Tilly associate cultural accounts of nationalism 
with explanations in terms of pre-existing ethnic solidarities and differences. 
Gellner (1983), by contrast, analyzes nationalism as a cultural phenomenon 
dependent not only on state formation and industrial society, but also on 
certain transformations of culture, such as the creation of "high cultures" 
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and their changing relations with popular or folk cultures, and the imbrication 
of all particular cultures within a putatively context-free space of cross-cul- 
tural communication. At the same time, he is clear in arguing that nationalism 
is distinctively modern and that it is not strictly the result of prior ethnicity: 


.. . nationalism is not the awakening and assertion of these mythical, supposedly 
natural and given units It 1s, on the contrary, the crystallization of new units, 
suitable for the conditions now prevailing, though admittedly using as their raw 
material the cultural, historical and other inheritances from the pre-nationalist 
world, (Gellner 1983: 49) 


Gellner (1983: 55) holds that "nationalism .. engenders nations, and not the 
other way round." Similarly, Hroch (1985) argues that nationalism arose from 
activities of cultural elites seeking histories and constituted the identities of 
nations without necessarily giving those identities any immediate political 
purpose; once established, such nationalist claims were available for politici- 
zation by cross-class groups. 

The state-centered approach, in sum, clarifies one dimension of nationalism 
but obscures others. In particular, it (i) makes it hard to understand why 
national identity can stir the passions it does, and (ii) encourages analysts 
either to ignore ethnic and other identities that do not coincide with states or 
to treat them as somehow naturally given. 


Ethnicity and History 


A good deal is left unaddressed by analyses that rely on states or markets 
as material "bases" to explain the cultural "superstructures" of nationalism. 
This is a thinner approach, for example, than stressing "the interaction of 
two orders of concrete experience, that of everyday life and that of relations 
with the state," each crucial to the construction of the contrasting figures 
of citizen and foreigner (Topalov 1991: 176). Similarly, many approaches 
to these issues emphasize the constitution of a social realm (or "civil society") 
separate enough from the state that state-society relationships might become 
the focus of attention and even of disputes over legitimacy (Poggi 1992, 
Cohen & Arato 1992, Seligman 1992, Keane 1983, and Calhoun 1993). 
Accounts that proceed in an exclusively state-centered way are also apt to 
underestimate the many changes in patterns of culture that preceded and 
paved the way for nationalism (prominent themes in the older historiography 
of Kohn 1944, 1962, Hayes 1931, 1966, Meinecke 1970, and Kedourie 
1960, 1974). The Protestant Reformation, for example, was crucial as it 
replaced the universalistic notion of Christendom with local and regional 
variants of the common faith, mobilized popular participation, promulgated 
vernacular discourse and printed texts, and invoked the theological (and in 
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some Calvinist variants political) sovereignty of the people against Church 
and monarchs (Kohn 1944: ch. 4). 

Later depoliticization of religion was, in turn, both an important concom- 
itant of state-building and an autonomously significant trend. In Switzerland, 
for example, longstanding religious divisions were replaced by linguistic ones 
in the wake of mid-nineteenth century revolutionary upheavals and national- 
ism. As late as 1848, Catholic territories made Protestantism unlawful (and 
vice versa). In this older regime, language was a matter of voluntary personal 
choice with little political significance. After mid-century, the pattern was 
reversed. Territories were divided on linguistic lines and religion was a matter 
of personal preference with markedly reduced political consequence (Ander- 
son 1991: 138). At the very least, state-centered and economy-centered 
accounts need cultural complements to deal with variation in the forms of 
nationalism. 

The older modernization theories generally saw nationalism as a functional 
substitute for local communities, religions, and other sources of identity and 
security that were necessarily disrupted by the larger scale, greater individu- 
alism, and more rapid social change of modernity (Geertz 1963, Gellner 1964, 
Hayes 1966). Identification of individuals with the nation (rather than tribe 
or other section) was a functional need to be achieved in the course of 
modernization (Apter 1965). Such treatments owe a great deal to binary 
models of social change like Durkheim’s (1893) account of the transition from 
mechanical to organic solidarity. Haas (1964: 465) puts forward a similar 
argument drawing on Weber and Toennies: 


The nation is a synthetic Gemeinschaft. In the mass setting of modern times, it 
furnishes the vicarious satisfaction of needs that have previously been met by the 
warmth of small, traditional, face-to-face social relations. As social life has been 
transformed by industrialization and social mobilization into something resem- 
bling a Gesellschaft based on interest calculations, the nation and nationalism 
continue to provide the integrative cement that gives the appearance of community. 


Nationalism is of interest to Haas (1986) solely as part of a process of 
rationalization. 

The implicit message of such theories was that attempts to maintain ethnic 
autonomy vis-a-vis the state were reactionary and antimodern; nationalism 
was bad when it was like ethnicity, but good when it was tied to a modernizing 
state. Indeed, for both state elites and modernization theorists, ethnic groups 
are defined in relation to the nation-state as subordinate internal and/or 
cross-cutting identities: Jews, Transylvanians, Tibetans, Ibo. The distinction 
between nation and mere ethnic group is precisely the attribution to the former 
of the right to an autonomous state, or at least an autonomy of some sort 
within the state. On such an account it doesn’t matter whether the nation is 
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an ethnic group that has proved its superiority in historical struggle (material 
or ideological), or a multi-ethnic population. 


Origin Myths 


Nationalisms vary, thus, between claims to have superseded traditional 
identities such as ethnicity by the founding of a true and modern nation, and 
claims to national identity and sovereignty rooted precisely in ancient 
ethnicity. The paradigmatic contrast of these two forms in the literature on 
nationalism is that between France and Germany. In both cases, historical 
narratives are mobilized to underpin the nationalist myths. The French 
narrative traces the nation to a modern act of founding by its members, people 
who were not constituted properly as French (rather than Provencal or 
Bearnaise, Protestant or Catholic) until that radically novel founding. It 
emphasizes the nation-making political form of the republic and the idea of 
citizenship (Best 1988). In Germany, nationalist history-writing pushes further 
back in pursuit of a "naturalizing" account of German ethnicity; Germany 
must be rooted in an "always already existing" ethnic identity. German 
nationalists from Herder and Fichte forward have emphasized ethnic rather 
than "political" or "civic" criteria for mclusion in the nation (see Alter 1989, 
Hayes 1926, 1931, Kedourie 1960, Kohn 1962 on this classic French/German 
contrast in styles of nationalism). When Renan (1990, orig. 1882) described 
the nation as a "daily plebiscite," thus, he was not making a universalizing 
statement or offering a definition. He was distinguishing those nations (such 
as France) that are the result of the free choices of their members from those 
(such as Germany) whose identity and cohesion are given to their members 
independently of any voluntary will. Such differences in nationalist narratives 
have practical consequences. Since voluntary will is so crucial to the narrative 
of French nationalism, for example, France makes it easier than Germany 
does for immigrants to attain citizenship (even though immigration itself, and 
right of legal residency, is no easier, Brubaker 1992, Noiriel 1988, 19912). 

There are many rhetorical attractions for nationalists to claim that their 
nations are simply given and immutable (i.e. ethnic) rather than constructions 
of recent historical action or tendentious contemporary claims. First and 
foremost, this claim "naturalizes" nationhood, and seems to leave third parties 
with the choice between recognizing a "natural" human identity or denying 
it and possibly even condoning its “genocide” (a neologism that reveals the 
specifically modern nature of this problem, Anderson 19915. Where it is 
recognized that a nation has a founding moment, it is still attractive to see 
this as a consequence not merely of choice, but of a long narrative of historical 
development that historically locates the proto-nation in primordial times. 
Much early scholarly writing on nations and nationalism worked within this 
rhetoric and sought to discover which were the "true" ethnic foundations of 
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nationhood (see Meinecke 1970 on Germany, Skurnowicz 1981 on Poland, 
and Zacek 1969 on Czechoslovakia). 

The contrasting rhetoric is tied both to ideas of popular sovereignty and to 
modernist (or Enlightenment) opposition to tradition. The claim to voluntary 
historical foundation (e.g. in US and French nationalist narratives) is a claim 
to the liberation of individuals both from illegitimate domination and from 
unreasoning acceptance of mere tradition. This rhetoric of liberating rationality 
thus assumes (though with opposite evaluation) the same idea of tradition as 
ancient, unquestioned inheritance as does the narrative of naturalizing 
primordiality. This is a problematic understanding of tradition and hence of 
ethnicity. 

As ideology, it is no doubt effective to claim that a nation has existed since 
time immemorial or that its traditions have been passed down intact from 
heroic founders. Sociologically, however, what matters is not the antiquity 
of the contents of tradition, but the efficacy of the process by which tradition 
constitutes certain beliefs and understandings as unquestioned, immediate 
knowledge, as the basis for disputing or questioning other claims (Calhoun 
1983). The focus is not simply on continuity, but on the reproduction of 
culture, the process of passing on that is the literal meaning of tradition (Shils 
1981). What is reproduced is not simply content, but a “habitus” or orientation 
to social action (Bourdieu 1976, 1990). Ethnicity or cultural traditions are 
bases for nationalism when they effectively constitute historical memory, 
when they inculcate it as habitus, or as “prejudice” (in Gadamer’s 1975, 1977 
sense of a precondition to judgment), not when (or because) the historical 
origins they claim are accurate. 

Weber (1922) expressed this common view in defining a traditional 
orientation as respect for that which has always existed, thereby suggesting 
that such an orientation must vanish in the face of modernity with its incessant 
social change. Such a view provides for easy inversion: whenever traditions 
can be shown to be created and/or recent, they must be false. This is the 
implication of Hobsbawm & Ranger’s (1983, Hobsbawm 1990) treatment of 
nationalism, in which they argue that because the “traditions” of nationalism 
are “invented” they are somehow less real and valid. But it is not clear why 
this should be so. Hobsbawm & Ranger seem to accept the notion that 
long-standing, “primordial” tradition would somehow count as legitimate, and 
therefore that illegitimacy follows from their demonstration that various 
nationalist traditions are of recent and perhaps manipulative creation. This 
seems doubly fallacious. 

First, all traditions are “created,” none are truly primordial. This was 
acknowledged, though rather weakly, even by some of the functionalists who 
emphasized the notion of (constructed) primordiality and the “givenness” of 
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cultural identities and traditions (Eisenstadt 1966, 1973, Geertz 1963, Gellner 
1964). Second, all traditions are internally contested and subject to continual 
reshaping, whether explicit or hidden. Potential lineage headmen argue over 
their status in terms of different narratives of descent and ancestral authority 
(Fortes 1945, 1949, Calhoun 1980). Similarly, as Leach (1954) and Barth 
(1969) and his colleagues have argued, ethnic identity is constituted, 
maintained, and invoked in social processes that involve diverse intentions, 
constructions of meaning, and conflicts. Not only are there claims from 
competing possible collective allegiances, there are competing claims as to 
just what any particular ethnic or other identity means. Dispute by no means 
always undermines traditional identities. Ethnicity is a rhetorical frame within 
which certain disputes are conducted; participation in the disputes can actually 
reproduce ethnic understandings (changed or unchanged). There is a differ- 
ence, thus, between disputes that challenge particular constructions of ethnic 
identity (or other aspects of tradition) and those that challenge the meaning- 
fulness of ethnic identity as such. 

In this context, the difference between the claims of nationhood and 
subordinate ethnicity need not be great (Horowitz 1985, Kellas 1991). Thus 
“nationalism” is identified with the state in both India and Africa, while 
"communalism" and "tribalism" are seen as divisive "ethnic" identities. 
General speaking, Nehru (and Indian predecessors back to the early 
nineteenth century) were more successful at invoking, claiming and/or creating 
a common sense of national identity than were most of their African 
counterparts. This was in part because of differences in the integration of 
precolonial “India” and the various colonial African states. But India too was 
in part a colonial creation, and the claim of national unity was developed in 
relation to British colonization (indeed, the length and intensity of British 
colonization may be as important a factor as precolonial history). In writing 
his popular history, The Discovery of India, Nehru (1949) was giving historical 
depth to a nationalist narrative that had as its other crucial base the more 
"modern" struggle against the British. Indian nationalists thus attempted to 
appropriate both the rationalist rhetoric of liberation and the claim of deep 
ethnic history, tradition almost to the point of primordiality. In this attempt, 
they shared much with many anti- and post-colonial nationalisms. So long as 
the British ruled in India, the project of nurturing a sense of ethnic nationhood 
was facilitated by the contrast with the obviously crucial colonial "other." 
The departure of the British from India changed the meaning of Congress 
nationalism, however, as this became the program of an Indien state, not of 
those outside official politics and resisting an alien regime. Among other 
effects of this, a rhetorical space was opened up for "communal" and other 
sectional claims that were less readily brought forward before (Freitag 1989, 
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Chatterjee 1986 and forthcoming). The opposition between primordiality and 
"mere invention," thus, leaves open a very wide range of historicities within 
which national and other traditions can exert real force. 


Language and History 


The translation of ethnicity into nationalism is partly a matter of converting 
the cultural traditions of everyday life into more specific historical claims. As 
Gellner (1983) suggests, this transformation is made possible partly by the 
development of a literate "high culture" and is an extension of its relationship 
to the everyday culture of face-to-face relationships. Anderson (1991) 
develops this point with more systematic attention to the role of the "print 
capitalism" of newspapers and novels, which not only engage in history 
making but constitute the nation as a community of like readers in the 
imagination of each. This is true not just of the contents of tradition, as folklore 
gives way simultaneously to "scientific history" and national myth, but of the 
very medium. Not only literacy but space-transcending communications 
technologies from print through broadcast can play a crucial role both in 
linking dispersed populations and in creating the possibility for producing a 
popular memory beyond the scope of immediate personal experience and oral 
traditions (Deutsch 1953, 1969, Calhoun 1992). Nowhere, however, is the 
issue clearer than in the historicizing approach to language of the early modern 
era. This reconstituted an aspect of the everyday cultural means of social life 
as part of a historical/ethnic claim to nationhood. 

Particularly in Germany, language was given a central status from Herder 
and Fichte on. In stressing the "originality" of the German language and the 
"truly primal" nature of the German character, Fichte, for example, claimed 
a supra-historical status for German nationality (Fichte 1968, orig. 1806- 
1807, Meinecke 1970: 92). Historically formed national characters were 
inferior, he argued, to the true metaphysical national spirits that were based 
on something more primal than common historical experience. This does not 
mean that Fichte and others of similar orientation saw glory only in the past. 
On the contrary, they envisaged a dramatic break with many aspects of the 
past and a national self-realization in what Fichte called a new history. The 
old history was not one properly self-made, not the product of the self-con- 
scious action of the nation as historical actor. Here echoes of the French 
revolution appear in German nationalist historiography. The rhetoric of 
nationalism came characteristically to involve the metaphor of awakening. 
This involved political, not just ethnic, claims. Positioning their nation within 
history allowed nationalists who claimed ancient roots still to evoke the 
heroism of creation and the prestige that since the Enlightenment adhered in 
many quarters to the production of something new—as in the United States’ 
claim to be "the first new nation" (Lipset 1960). 
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Nationalism has a complex relationship to history. On the one hand, the 
production of historical accounts of the nation can figure very prominently 
(and this is hardly distinctive of Germany or the West; see examples in Nehru 
1949, Gandhi 1939, 1967, and discussion in Chatterjee 1986 and forthcom- 
ing). Indeed, the modern discipline of history is very deeply shaped by the 
tradition of producing national histories designed to give readers and students 
a sense of their collective identity. At the same time, however, nationalists 
are prone, at the very least, to the production of Whig histories, favorable 
accounts of "how we came to be who we are.” As Ernst Renan (1990: a 
wrote famously in 1882: 


Forgetting, | would even go so far as to say historical error, is a crucial factor in 
the creation of a nation, which is why progress in historical studies often 
constitutes a danger for {the principle of] nationality. Indeed, historical enquiry 
brings to light deeds of violence which took place at the origin of all political 
formations, even those whose consequences have been altogether beneficial. Unity 
is always effected by means of brutality 


Not only is the definition of nation subject to contest and struggle, the fruits 
and even the violence of these contests and struggles become inescapably part 
of who we are. For all its civic rationalism, France has hardly been free from 
appeals to ethnic nationalism. An ethnic conception of la patrie stood behind 
much of the attack on Dreyfus; Maurras sought to define a true French nation 
free of Jews, Protestants, Freemasons and other foreigners (Sutton 1982). 
Aspects of this heritage remain important in contemporary debates over 
immigration (Todorov 1990, Noiriel 1988). Indeed, Greenfeld (1991, 1992) 
goes so far as to group French nationalism with those to the East as 
“collectivistic-authoritarian” and based on ressentiment (by contrast to the 
"individualistic-libertarian" English variant). France's violent and irrational 
Anglophobia (Greenfield 1992:183) is part of her evidence. 

Nineteenth- and twentieth-century nationalisms have been particularly 
obsessed with history, as with ethnicity, perhaps because most involve claims 
to nationhood which are in important ways problematic or challenged by 
existing states. Thus Gandhi's Hindu nationalist opponent, Savarkar 
(1937:284) felt compelled to argue that “venly the Hindus as a people differ 
most [sic] markedly from any other people in the world than tbey differ 
amongst themselves. All tests whatsoever of a common country, race, 
religion, and language that go to entitle a people to form a nation, entitle the 
Hindus with greater emphasis to that claim." Many Indian nationalist 
historians took on a dual challenge in wnting their histories (Chatterjee 
forthcoming). First, they sought to show that India was one country, against 
the British suggestion that without the alien Raj, disunity and conflict would 
reign amongst its many contending peoples (or "communities"). Second, they 
sought to show that this one country was essentially Hindu, not Muslim (and 
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thus among other things constituted “indigenously” rather than by previous 
imperial invasions). Indian intellectuals from the nineteenth century on were 
often as cosmopolitan as their European counterparts (and certainly at least 
as likely to be multilingual). But this could never appear as unproblematic in 
the context of colonial rule as it had for the European enlighteners. Many 
Indian nationalists (including Nehru) wrote in English and spoke it more 
comfortably than any “Indian” language; they helped, indeed, to make English 
an Indian language. But this involved a tension between English as the 
language of the colonizer and as the putative lingua franca that was to help 
constitute one nation by cutting across the linguistic divisions of the 
subcontinent. Moreover, at the same time that some nationalists appropriated 
English as an Indian language, others produced a renaissance of modern Indian 
languages like Bengali or Marathi. As in Catalonia, Hungary, China, and 
elsewhere, nationalism meant producing a new, modern literature in the 
vernacular language. One dimension of this was the attempt to forge a unity 
between the language of literature and intellectuals and that of ordinary 
people—since groups previously separated by language were now to be united 
by a national language. 

For the German Romantics, language was a key test of the existence of a 
nation (Kedourie 1960: 62—73). Language, moreover, was understood primar- 
ily in terms of continuity, since "few things seem as historically deep-rooted 
as languages, for which no dated origins can ever be given" (Anderson 1991: 
196). Language often plays a key role in ethnic (or "naturalizing") versions 
of nationalism, since an ancient language, shared as the parental tongue among 
the members of the nation, seems a guarantee of its true existence prior to 
and separate from any particular set of political arrangements (including 
fragmentation or alien rule). But the language of nationalist movements is 
often not the parental tongue of the putative nation's members, not the first 
language of each, but rather the second language that unites them. It may be 
an elite language, shared among aristocrats and/or a bourgeoisie; it may be 
the language of a colonial power. The shared language is not the “test” of 
nationhood, but the means of imagining—and thereby creating—the nation 
(Anderson 1991). 

Language figures in at least three different ways in accounts of nationalism. 
First, it is a central part of the claim that nationhood is rooted in ethnicity. 
This leads to attempts to show the historic depth and distinctiveness of 
languages. Second, shared language is a condition (or at least a facilitator) 
of claimed national community regardless of whether it is ancient or 
distinctive. As Anderson stresses, the pioneering nationalisms of the Americas 
were launched in the colonial languages of Spanish and English. Third, 
opposition to linguistic variation is a key way in which nationalists in power 
attempt to make the nation fit the state. Thus most citizens of France did not 
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speak French until the late nineteenth century, and only after the imposition 
of often-resented educational uniformity (Weber 1976). Russification pro- 
grams begun under the Czars were carried forward by communist rulers after 
only brief revolutionary interruption. In the last of these three we see clearly 
the impact of state-building, and the strong case for a state-centered theory 
like Tilly's (discussed above). But such a theory offers little help in making 
sense of the first two, or even in explaining why language should be an issue. 
Part of the answer to this question has to do with the relationship between 
claims to pre-existing ethnicity and claims to founding historical moments 
and political forms. Part of it also has to do, however, with the issue of how 
people imagine the nation, how this particular category of identity had come 
to figure so prominently in the modern world. 


Ethnic Continuities 


Generally speaking, the most prominent twentieth-century analysts of nation- 
alism have rejected the claim that nationalism can be explained by pre-existing 
ethnicity. Kohn (1944) and Seton-Watson (1977) have stressed the crucial 
role of modern politics, especially the idea of sovereignty. Hayes (1926, 1960) 
has argued for seeing nationalism as a sort of religion. Kedourie (1960) has 
debunked nationalism by showing the untenability of the German Romantic 
claims. More recently, Gellner (1983) has placed emphasis on the number of 
cases of failed or absent nationalisms: ethnic groups which mounted either 
little or no attempt to become nations in the modern senses. This suggests 
that even if ethnicity plays a role it cannot be a sufficient explanation (though 
one imagines the nineteenth-century German Romantics would simply reply 
that there are strong, historic nations and weak ones destined to fade from 
the historic stage). Hobsbawm (1990, Hobsbawm & Ranger, 1933) has largely 
treated nationalism as a kind of second-order political movement based on a 
false consciousness which ethnicity helps to produce but cannot explain 
because the deeper roots lie in political economy not culture. 

Against this backdrop, Anthony Smith (1983, 1986, 1991) has tried to show 
that nationalism has stronger roots in premodern ethnicity then others have 
accepted. He acknowledges that nations cannot be seen as primordial or 
natural, but nonetheless argues that they are rooted in relatively ancient 
histories and in perduring ethnic consciousnesses. Smith agrees that nation- 
alism, as ideology and movement, dates only from the later eighteenth century, 
but argues that the "ethnic origins of nations" are much older. Smith focuses 
on ethnie—ethnic communities with their myths and symbols—and shows 
that these exist in both modern and premodern times, and with substantial 
continuity through history. Because, Smith argues, “myths, symbols, mem- 
ories and values are 'carried' in and by forms and genres of artifacts and 
activities which change only very slowly, so ethnie, once formed, tend to be 
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exceptionably durable under ‘normal’ vicissitudes, and to persist over many 
generations, even centuries, forming ‘moulds’ within which all kinds of social 
and cultural processes can unfold and upon which all kinds of circumstances 
and pressures can exert an impact” (1986: 16). This is the foundation both of 
particular nations and of the idea of nation. 

Smith argues that the origins of modern nationalism lie in the successful 
bureaucratization of aristocratic ethnie, which were able to transform them- 
selves into genuine nations only in the West (Smith 1986: 109). In the West, 
territorial centralization and consolidation went hand in hand with a growing 
cultural standardization. “The indivisibility of the state entailed the cultural 
uniformity and homogeneity of its citizens” (1986: 134). “It would indeed 
not exaggerate the matter to say that what distinguished nations from ethnie 
are in some sense, ‘Western’ features and qualities. Territoriality, citizenship 
rights, legal code and even political culture, are features of society that the 
West has made its own. So is the realization of social mobility in a unified 
division of labour’ (1986: 144). Well beyond the West, however, the 
compulsion for ethnie to enter the political arena is seemingly universal to 
the modern era. “In order to survive, ethnie must take on some of the attributes 
of nationhood, and adopt a civic model” (1986: 157). Cross-class inclusion 
and mobilization for common political purposes are essential (1986: 166). 
Conversely, rooted in ethnicity, nations are long-term processes, continually 
reenacted and reconstructed; they require ethnic cores, homelands, heroes and 
golden ages if they are to survive. Small, breakaway nations rooted in 
particularist, quasi-religious visions are the most common new nationalist 
projects today (1986: 212-13). Nonetheless, this tendency towards the 
production of many new small nations is contained, Smith suggests (writing 
before the events of 1989-1992 in Eastern Europe, the Soviet Union, and 
Africa), by the existing framework of nation states (1986: 218, 221). In sum, 
“modern nations and nationalism have only extended and deepened the 
meanings and scope of older ethnic concepts and structures. Nationalism has 
certainly universalized these structures and ideals, but modern ‘civic’ nations 
have not in practice really transcended ethnicity or ethnic sentiments” (1986: 
216). 

Smith does not claim that ethnicity is natural, rather than socially 
constructed. His argument, rather, is that ethnicity is very slow to change. 
He acknowledges also that premodern ethnic boundaries were not sharply 
fixed, though he does claim that they maintained a level of integrity. Above 
all, Smith suggests that it is possible to trace a “genealogy of nations” in 
which both cultural and social structural variables can be introduced to account 
for which ethnies- become nations. The crucial moment in such genealogies, 
he suggests, is the transformation of the members of an ethnie into citizens. 
This is a cultural transformation of the character of membership, stressing the 
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lateral ties that link members despite class divisions. and that form the basis 
for potential political mobilization (Smith 1986: 166). 

Smith stresses the continuity in ethnic groupings and the relations of cultural 
similarity that define them. In a clear contrast, Brass (1991: 8) offers an 
account of ethnicity as the product of manipulation, or at least recurrent 
invocation. Ethnic groups “are creations of elites, who draw upon, distort, 
and sometimes fabricate materials from the cultures of the groups they wish 
to represent in order to protect their well-being or existence or te gain political 
and economic advantage for their groups as well as for themselves.” 


Imagined Communities and Categorical Identities 


It is tempting to explain national identity as a transformation of ethnic or 
cultural similarity wrought by state-building. Certainly a crucial difference 
between ethnicities and nations is that the latter are envisioned as intrinsically 
political communities, as sources of sovereignty, while this is not central to 
the definition of ethnicities. There are, however, a number of obstacles to 
seeing this as the whole of the issue. First, nationalisms do not vary neatly 
with the success of efforts to create consolidated states. As Gellner points 
out, there are vastly more languages and ethnic or cultural groups than there 
are nationalist movements or states. This is not just because some lost out in 
a struggle for national identity or autonomy. 


For every effective nationalism, there are n potential ones, groups defined either 
by shared culture inberited from the agrarian world or by some other link .. which 
could give hope of establishing a homogenous industrial community, but which 
nevertheless do not bother to struggle, which fail to activate their potential 
nationalism, which do not even try. (Gellner 1983: 45) 


Beyond this, nationalism is not simply a claim of ethnic similarity, but a 
claim that certain similarities should count as the definition of political 
community. For this reason, nationalism needs boundaries in a way premodern 
ethnicity does not. Nationalism demands internal homogeneity throughout a 
putative nation, rather than gradual continua of cultural variation or pockets 
of subcultural distinction. Perhaps most distinctively, nationalists commonly 
claim that national identities “trump” other personal or group identities (such 
as gender, family, or ethnicity) and link individuals directly to the nation as 
a whole. This is sharply contrary to the way in which most ethnic identities 
flow from family membership, kinship, and membership in intermediate 
groups. 

Nationalism, in short, involves a distinctive new form of group identity or 
membership. It is a new rhetoric of belonging to large scale collectivities. 
This depends on new forms of collective imagination, and also on communi- 
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cations capacities and social organizational conditions that encourage a sense 
of identity with large populations of distant and largely anonymous others. It 
also depends crucially on modern ideas of individual equivalence. 


Individualism 


In nearly all premodern patterns of social organization, people were members 
of polities and other social groups primarily by virtue of their occupation of 
a variety of ascribed statuses based on descent, kinship, age, gender and the 
like. Their membership in larger groups, like clan, was based on and grew 
directly out of smaller groups like lineage segments, and out of specific 
interpersonal relationships like father-son (Fortes 1945, 1949, Calhoun 1980). 
This was true, despite otherwise dramatic differences, for relatively small- 
scale African societies and for such extremely large-scale polities as imperial 
China. The modern notion of self as individual changed this. Personal identity 
came to be seen increasingly as the attributes of a self-contained individual— 
what Taylor (1989) has charactenzed in Locke's writings as the "punctual 
self.” “The alternative to playing the role of so-and-so's son, so-and so's 
brother, so-and-so's wife was," as Schwarcz (1986: 112) has written of 
protagonists in China's largely nationalist New Culture movement, “to gain 
a positive sense of one's own individuality." Such thinking made it common 
to understand social groupings as sets of equivalent persons (as in the idea of 
class as well as in liberal individualism) rather than webs of relationships 
among persons or hierarchies of positions (Dumont 1982). The modern idea 
of nation, despite its roots in notions of descent, has been nearly always such 
a category of equivalent persons. 

So, despite more relational roots, is the prevailing modern usage of 
"ethnicity." This is revealed in the way in which censuses have been 
constructed and conducted, quantifying the members of ethnic, racial, and 
national categories (Anderson 1991: 168). It is revealed also in the ways 
in which Western social scientists have sometimes hypostatized notions like 
caste and lineage segment or corporation (see Kapferer 1988: ch. 7, 
Chatterjee forthcoming). Terms with at least in part a relational usage are 
recast as though they were simply collectivities of equivalent individuals. 
Similarly, Ekeh (1990) has noted a tendency to abandon the use of tribe 
in social anthropology and African studies, and to replace it with "ethnic 
group." But this has the effect of imposing a categorical notion—a collection 
of 1ndividuals marked by common ethnicity—in place of a relational one. 
Where the notion of tribe pointed to the centrality of kin relations (all the 
more central, Ekeh suggests, because of weak African states from whose 
point of view "tribalism" is criticized), the notion of ethnic group implies 
that detailed, serious analysis of kinship is more or less irrelevant. In part, 
this is a response to recognition of the contested nature of ethnic identities; 
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it involves an attempt to move away from substantive claims to identify 
ethnicity on the basis of the “real” shared descent of the members of a 
group. Weber (1922: 389) defined an ethnic group as one whose members 
“entertain a subjective belief in their common descent because of similarities 
of physical type or of customs or both, or because of memories of 
colonization and migration.” Barth (1969) took this logic a step further and 
abandoned even the notion of subjective belief in common descent, replacing 
it with simply the existence of recognized group boundaries. Here the 
triumph of a categorical logic is complete: an ethnic group is simply a 
bounded set of individuals, not necessarily characterized by any internal 
pattern of relationships, much less one of kinship or descent. Once ethnic 
groups are treated in this purely categorical way—-as they are in much 
everyday contemporary discussion as well as in academic studies—similar- 
ities rather than relationships form the defining connection among members. 
This opens the door to new pressures for conformity. 

The categorical nature of national identities is linked strongly to ideas of 
purity and normalizations of the “correct” way to be a member of a nation. 
Nationalisms linked to state power are often repressive, thus, not only of the 
members of "alien" nations or ethnic minorities (like Jews in Europe) but of 
their own members. Thus European nationalisms have commonly been 
strongly colored by ideas of middle-class respectability, particularly in the 
realm of sexuality (Mosse 1985). National identity has been an eroticized 
identity, and one that carried prohibitions of deviant sexualities as sharply as 
on deviant ethnicities (Parker et al 1992). Nationalism has also been a 
distinctly gender-biased ideology in many settings (Eley 1992). Valuing the 
family as the source of the nation’s continuity in time, nationalist ideologues 
have seen men as future martyrs, women as mothers. Beyond this, however, 
nationalists resist women’s movements because accepting the domination of 
male interests and perceptions merely perpetuates a taken-for-granted, mono- 
lithic view of the nation, while encouraging women to identify their distinctive 
interests and views opens claims that gender has autonomous status as a basis 
for personal identity which does not pale into insignificance before the 
commonalties of (male-dominated) nationhood. 

Individualism exerts another influence on ideas of nation. Nations are 
generally seen as logical equivalents, and themselves as individuals. Just as 
liberal political theory suggests that employer and employee, rich man and 
poor woman, are equivalent political persons, so liberal international theory 
suggests that nations like San Marino and Singapore are formally equivalent 
to China and Germany.? As individuals, nations may also be understood as 


*The contemporary discourse of multiculturalism suggests that ethnic groups are to be seen 
in much the same relationship of formal equivalence (Taylor 1992) 
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unitary subjects in historical time. In Fichte's words (quoted in Meinecke 
1970: 89), "nations are individualities with particular talents and the possi- 
bilities of exploiting those talents." Nations not only could take action but 
could experience abuse; especially after 1848, Poland was conceived as "the 
martyr-nation" (Kohn 1944, Walicki 1982, Skurnowicz 1981, Meinecke 
1970); Russian nationalism was colored and driven by a constant “ressenti- 
ment" (Greenfield 1990, 1992). Marx's contemporary, Friedrich List, “pro- 
nounced nations to be ‘eternal,’ to constitute a unity both in space and time...” 
(Szporluk 1988: 115). This did not preclude the idea that nations were capable 
in some sense of making themselves, forging a higher individuality out of 
heterogeneous constituent parts. 

Anderson (1991: 26) has seized on just this aspect of individuality as central 
to the modern understanding of nation: 


The idea of a sociological organism moving calendrically through homogenous, 
empty time is a precise analogue of the idea of the nation, which also is conceived 
as a solid community moving steadily down (or up) history. 


As Anderson has stressed, the category of nation thus unites the living and 
the dead. This is a crucial explanation for why the nation can demand such 
extraordinary sacrifices and commitments from its members. It joins the 
biographies of individual persons and of the nation as a whole in a common 
historical narrative. Not only does the nation locate individuals temporally in 
relation to past and future generations, and in the global context as members 
of one among many nations, the nation also locates each individual's 
biography and quotidian narrative as one among the many comparable 
biographies of the members of the specific nation (see also Bloom 1990). 


Imagined Communities and Indirect Relationships 


As a category of equivalent persons, a nation is, in Anderson’s (1991) 
evocative phrase, an "imagined community." Rather than treating nationalism 
as a genre of ideology comparable to liberalism or fascism, Anderson (1991: 
5) suggests that we regard nationalism as a distinct mode of understanding 
and constituting the phenomenon of belonging together, comparable to kinship 
or religion. A nation, thus: 


is an imagined political community ... It is imagined because the members of 
even the smallest nation will never know most of their fellow-members, meet 
them, or even hear of them, yet in the minds of each hves the image of their 
communion... The nation is imagined as limited because even the largest of them, 
encompassing perhaps a billion living human beings, has finite, if elastic, 
boundaries, beyond which lie other nations... It is imagined as sovereign because 
the concept was bom in an age in which Enlightenment and Revolution were 
destroying the legitimacy of the divinely-ordained, hierarchical dynastic realm... 
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Finally, it is imagined as a community, because, regardless of the actual inequality 
and exploitation that may prevail in each, the nation 1s always conceived as a 
deep, horizontal comradeship. (1991: 6-7) 


Much of Anderson’s book (perhaps the most original, if not the most 
systematic contribution to the large recent literature on nationalism) is devoted 
to trying to account for the nse of this distinctive way of imagining community. 
In addition to the account of distinctive temporal location already touched on, 
Anderson offers three main arguments. 

First, Anderson takes up the notion of language as the essential cultural 
condition of nationhood. He notes that nationalism did not arise simply out 
of long-standing traditions of linguistic commonality. On the contrary, in 
many settings nationalism involved the privileging of vernaculars in place of 
Latin and other previously widely used languages of high culture and 
administration (and of cross-regional sharing). It sometimes involved the 
recovery of little used languages. It often depended on the integration of more 
or less distinctive dialects or members of language families into new common 
languages. And in many cases nationalist imaginings took place in the 
language of colonial powers. What gave language its efficacy in relation to 
nationalism was the coincidence of print technology and capitalism. 

By pushing for ever-larger markets, capitalist cultural production (in the 
form of books and newspapers) called forth “unified fields of exchange and 
communication below Latin and above the spoken vernaculars” (1991: 44). 
That is, in early modern Europe, capitalists sought markets larger than the 
small number of elite readers of Latin, and larger than the number of speakers 
of nearly all local vernaculars. They thus pioneered the creation of the specific 
linguistic communities associated with eventual national identities. In addi- 
tion, print-capitalism gave a new fixity to language, encouraging a stable 
orthography, grammar, and form in general. This encouraged the image of 
antiquity conducive to the notion of long-standing national identity by 
obscuring the extent to which languages gradually evolved and successive 
members of putative nations spoke mutually unintelligible tongues. Not least 
of all, print-capitalism standardized usage of certain administratively sanc- 
tioned languages, thus disadvantaging within each realm the speakers of other 
languages. Where Latin had previously united dozens of local dialects and 
languages within Hapsburg domains, for example, increasing reliance on 
German disadvantaged Hungarian elites (among others) and created a pressure 
for nationalism within and eventually against the empire. At the same time, 
this gave incentives (not always equally taken up) for elites to make common 
cause (and common culture) with non-elites. Where premodern society had 
been divided especially into vertical layers, modern politics (including the 
politics of language) encouraged the overcoming of vertical divisions and the 
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substitution of horizontal borders. As Gellner (1983: 18) described this 
transition, “a high culture pervades the whole of society, defines it, and needs 
to be sustained by the polity. That is the secret of nationalism.” 

In addition to helping to sort out the role of language in the creation of 
ethnic foundations for nationalism, Anderson’s formulation of “print-capital- 
ism" also helps to make sense of the new kind of imagining of community 
more directly. The readers of the same novels and newspapers were joined 
in imagining communities of other such readers, and of imagining commu- 
nality with the protagonists of the stories they read (1991: 24-36). The readers 
of daily newspapers not only learned the same news as each other, they learned 
“wholly new ideas of simultaneity” (1991: 37). They learned to situate 
themselves in terms of the activity of many individuals (and nations) taking 
place in the same temporal moment, not solely in a linear development. This 
also allowed for a sense of shared paths, which “could arise historically only 
when substantial groups of people were in a position to think of themselves 
as living lives parallel to those of other substantial groups of people—if never 
meeting, yet certainly proceeding along the same trajectory” (1991: 188). In 
place of direct relationships among people meeting face to face, thus, print 
technologies (proliferated by capitalist production relations) encouraged the 
creation of a new kind of indirect relationship, a social link existing only by 
virtue of the new medium of communication and its supporting social 
organizations. Much the same was true of markets, which joined distant and 
anonymous populations in indirect and sometimes invisible, but clearly 
powerful social relations (Calhoun 1991, 1992). 

The importance of communications media to national integration has 
certainly been noted before (notably by Deutsch 1953, 1969). In most earlier 
treatments, however, the categorical identity of the nation is presumed and 
research is focused on how the development of communications capacity 
enhances the soctal and political integration of the nation. Anderson’s central 
contribution is to explain how communicative forms figured in creating the 
categorical identity or imagined community of the nation itself. In addition 
to media, Anderson creatively analyzes the career trajectories of creole 
officers of colonial states. He locates an important and early source of 
nationalism in their movement around colonies, and the limits placed on 
both their upward and lateral movement out of the colony in which they 
served. 

These bounded imaginings were given graphic and synoptic expression in 
the proliferation of maps. Early maps had been either cosmographies, locating 
a dynastic or religious realm in relation to heaven and the netherworld, or 
travelers guides, working by landmarks from one location to another. In the 
nineteenth century, maps not only began to proliferate by virtue of mechanical 
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reproduction, they began to register the whole world as a set of bounded 
territories, different colors for different empires or autonomous countries 
(Anderson 1991: 170-78). They became the visual representation of a world 
organized into a system of states. They also offered maps of individual 
countries as “logos,” the image of their territorial shape giving a definite form 
to the imagined community. In something of the same way, museums, like 
history-writing, gave temporal depth to nationalism (Anderson 1991: 178-85, 
Maier 1987). Colonial powers deployed archaeology to unearth the tangible 
(and preferably monumental) remnants of ancient cities and sacred sites; these 
in turn were transformed into tourist attractions and objects of photographs, 
recordings of a tradition constituted in its distinctness from the modern state. 
In cosmopolitan museums artifacts from far-flung contexts were (and are) 
displayed within classifications ordering the world into nations. In national 
museums, artifacts from disparate temporal and spatial settings are arranged 
into national narratives. The crucial link was the production of replicable 
series of artifacts available for classification into types or periods (as distinct 
from temples still seen as singular in their sacredness, or modern “auratic” 
works of art imbued with the singularity of an individual creator). The idea 
of nation is itself an instance and an archetype of this classifying logic of 
categorical identities. 


Conclusion 


The relationship between nationalism and ethnicity is complex. Neither is 
vanishing as part of an obsolete traditional order. Both are part of a modern 
set of categorical 1dentities invoked by elites and other participants in political 
and social struggles. These categorical identities also shape everyday life, 
offering both tools for grasping preexisting homogeneity and difference and 
constructing specific versions of such identities. While it is impossible to 
dissociate nationalism entirely from ethnicity, it is equally impossible to 
explain it simply as a continuation of ethnicity. Numerous dimensions of 
modern social and cultural change, notably state-building, individualism, 
and the integration of large-scale webs of indirect relationships all serve to 
make both nationalism and ethnicity salient. Nationalism, in particular, 
remains the preeminent rhetoric for attempts to demarcate political commu- 
nities, claim rights of self-determination and legitimate rule by reference to 
"the people" of a country. Ethnic solidarities and identities are claimed most 
often where groups do not seek "national" autonomy but rather a recognition 
internal to or cross-cutting national or state boundaries. The possibility of 
a closer link to nationalism is seldom altogether absent from such ethnic 
claims, however, and the two sorts of categorical identities are often invoked 
in similar ways. 
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Abstract 


This chapter reviews research on the segregation of women and men in the 
workplace. After examining ways to measure segregation, I summarize trends 
in sex segregation in the United States and cross-nationally. Occupational 
segregation has declined since 1970, but most workers remain in sex 
segregated jobs. I then evaluate the empirical support for explanations for 
segregation. Demand-side explanations include employers’ preferences, the 
demand for workers, economic pressures, discrimination, and personnel 
practices. Supply-side explanations include the size of the labor supply, the 
neoclassical human-capital explanation, gender-role socialization, workers’ 
values, and the opportunity structure. I conclude that a variety of social and 
economic forces operate both to perpetuate and to reduce segregation. 
However, workplace segregation is an important mechanism in sex stratifica- 
tion, and a stratification perspective stresses the importance of demand-side 
factors. I call for research on sex segregation that examines the behavior of 
all labor market actors. 


INTRODUCTION 


Until the 1970s, the term “segregation” signified separation of the races. The 
phrase “race segregation” still denotes the races’ physical segregation into 
different schools or neighborhoods. However, segregation is more than 
physical separation. It is a fundamental process in social inequality. The 
characteristics on which groups are sorted symbolize dominant or subordinate 
status and become the basis for differential treatment. Indeed, segregation 
facilitates unequal treatment by subjecting groups to different reward systems. 
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By ensuring groups’ mutual ignorance of their respective contributions, it 
preserves an illusion of equal—although separate—treatment. And, of course, 
it is the dominant group that instigates segregation, sets its limits, and 
sometimes permits exceptions (Myrdal 1944, 1962:575). 

Gross (1968) introduced the concept of “sex segregation” into the socio- 
logical lexicon to describe women’s and men’s concentration in different 
occupations. His use was innovative both in treating the sexes’ concentration 
in different occupations as sociologically important and in extending the 
concept to encompass functional differentiation.’ Gross’s article and the 
contemporaneous work of Oppenheimer (1968, 1970) initiated a thriving 
research specialty. This essay reviews the recent—mostly post-19807— re- 
search on the factors associated with tbe persistence and decline of sex 
segregation in the workplace. 

This essay also seeks to emphasize segregation's role in gender stratifica- 
tion. Given this goal, it 1s useful at the outset to summarize the range of 
inequities that follow from sex segregation in the workplace. First and 
foremost, the more female an occupation, the lower its pay (Treiman & 
Hartmann 1981, Marini 1989). Female-typical jobs also are less likely to 
provide benefits (Perman & Stevens 1989), on-the-job training (Duncan & 
Hoffman 1979, Farkas et al 1991), promotion opportunities (Rosenbaum 
1985:130, Glass 1990, Steinberg et al 1990), and the opportunity to exercise 
authority (Reskin & Ross 1992). In general, society and employers appear to 
devalue women's work, at least in part because women do it (Reskin 1988). 


MEASURING SEGREGATION 


Complete segregation excludes members of some groups from certain 
outcomes and confines them to others, while perfect integration distributes 
group members proportionally across outcomes. Although these extremes are 
rare, they serve as standards against which we can measure the extent of 
segregation. To do this, most researchers use the index of dissimilarity (or 
"index of segregation"), which compares two groups' distribution across a set 
of categories.’ The value of the index is the proportion of either group that 
would have to move to a category which the other sex dominates for the 
groups' proportional distributions to be identical. ' 

The amount of sex segregation one observes and our ability to compare 


‘However, Myrdal (1944, 1962:106-7, 313) wrote of “vocational segregation" and “job 
segregation” by race. 

*For pre-1980 studies, see Szafran (1982) and Reskin & Hartmann (1986). 

?Duncan & Duncan (1955) defined the index of dissimilarity as D = x, — y,/2, where x; = the 
proportion of group X in occupation i and y, = the proportion of group Y in occupation i. 


SEX SEGREGATION 243 


levels of segregation are affected by three factors: the fineness of occupational 
distinctions, the sizes of occupations, and the sex composition of the labor 
force. Most important, the amount of segregation we see depends on the level 
of aggregation of the units being examined. An example should make this 
clear. In 1991 women were 45.4% of all employees and 45.8% of the “broad” 
occupational category of professional workers, With only that information, 
one might conclude that professional work is almost perfectly integrated. But 
compare the sex composition of two “detailed” occupations within the broad 
category, professional workers: women were 37.7% of the college and 
university teachers, while they accounted for 73.7% of other teachers (US 
Bureau of Labor Statistics 1992:Table 22).* Detailed occupations are in turn 
aggregations of jobs—which are the specific positions that workers hold in 
specific establishments. For example, the occupation “college and university 
teacher” includes someone teaching night classes on repairing office machines 
at Parkland Community College as well as a distinguished professor of 
mathematics at Harvard. The more aggregated the categories across which 
segregation is measured, the less segregation we will capture. Segregation 
indexes computed across broad occupational categories miss substantial 
within-category segregation. Even estimates based on detailed occupational 
categories devised by the Census understate the extent of job-level segrega- 
tion.” For example, the level of sex segregation across jobs in the 1970s was 
about 30 points higher than the index of occupational sex segregation (Bielby 
& Baron 1984). 

The addition of new occupational titles that represent finer distinctions 
within occupations will increase the value of the index, net of any change in 
the amount of segregation.* The sensitivity of the segregation index to level 
of aggregation precludes comparing indexes for occupations aggregated to 
different degrees. Given the Census Bureau's periodic expansions and 
reclassifications of occupations, assessing trends in segregation requires 
caution. 

Changes in the relative sizes of categories imperil assessing trends in 


*US Census “detailed” occupations, of which there were 512 in 1990, are coded at the three-digit 
level. 

SOf course, the optimum level of detail depends on one’s substantive and theoretical interests. To 
the extent that segregated jobs lead to unequal pay or other rewards, it is preferable to measure 
segregation at the job level. Job-level measures would also be needed to observe the effects of 
modified personnel practices by particular employers. 

ÜThis mcomparability has no acceptable solution. Aggregating categories to achieve 
comparability will underestimate the extent of segregation. Computations based on occupations 
present at every time point yield results that are inaccurate for those years for which any occupations 
were excluded (King 1991) Only if new titles reflect new occupations, can one legitimately compare 
indexes based on different numbers of categories (King 1992) 
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segregation.’ A standardized index which holds category size constant allows 
researchers to decompose the change in an index to the redistribution of groups 
across categories (in other words, integration) and to differential changes in 
the sizes of categories (Blau & Hendricks 1979, Williams 1979). 

The relative shares that two groups comprise of a population also affect the 
index of segregation: the less equal their proportions, the lower the value of 
the index (Newman & Mittman 1992). As a result, differences in two groups’ 
shares (either across areas or over time) will contribute to differences in index 
values, net of differences in the extent of segregation. 

The segregation index is insensitive to whether one group systematically 
commands the more desirable outcomes. A society in which one fifth of the 
doctors and engineers are women and one fifth of the kindergarten teachers 
and daycare workers are men (assuming only these four equal-sized occupa- 
tions) would have the same index as one in which one fifth of the lawyers 
and daycare workers are women and one fifth of the engineers and kinder- 
garten teachers are men.? Segregation indexes tell how much groups are 
separated; they do not speak to the question of who has a monopoly on the 
more or less desirable outcomes. 


Sex-Typed Occupations 


The index of segregation conceptualizes segregation as a property of two 
groups' distributions across a set of outcomes. However, some questions lead 
researchers to conceptualize as segregated (or sex typed) any outcome in 
which members of one group are concentrated or excluded—i.e., heavily male 
or female occupations. Cutting points for designating occupations as sex typed 
are arbitrary: 7596 or 8096 one sex, a one-sex majority, or a percentage-point 
deviation from the sexes' representation in the labor force." 

Theory sometimes dictates other measures of segregation. "° For example, 


The segregation index was developed for similar-size units (e.g. schools, census districts), while 
occupations vary enormously 1n size (Newman 1992). 

®For additional lumutations of the index of segregation, see James & Taeuber (1985), Jacobs 
(19892), Baron et al (1991), and Newman & Mittman (1992). 

? An exception 1s Bielby & Baron (19862) who, observing the discontinuity in the distribution of 
sex composition across jobs, modelled sex composition more successfully with a dichotomous 
dependent variable than with the continuous variable, percentage female. Labelling occupations as 
sex typed permits estimates of the proportion of workers employed in “segregated” occupations, the 
characteristics of such workers or occupations, and the extent of mobility between sex-typical and 
sex-atypical occupations. However, the variable definitions of segregated occupations hamper 
comparisons of the extent to which the sexes are concentrated 1n sex-atypical occupations. (The 
continually changing sex composition of the labor force exacerbates these limitations.) 

10or example, Charles (1992) used a “ratio index of segregation" to compare countries’ levels of 
sex segregation, net of cross-national differences ın occupational structure and women’s labor force 
participation. 
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researchers interested in the concomitants of occupations’ sex composition 
use the percentage female or male in each occupation as a variable. A measure 
of segregation relevant for understanding the sexes’ differential perceptions 
of change is the likelihood of intergroup contact in the workplace (Jacobs 
1986). To identify the factors facilitating and impeding integration in 
California state agencies, Baron et al (1991:1373) used a measure of gender 
skewness proposed by Theil & Finizza that adjusts for an agency’s overall 
sex ratio, although if one is interested in long-run change, it seerns appropriate 
to ask why an organization’s overall sex composition should be taken as given. 

Most studies of segregation focus on a single characteristic. While sex is 
the paramount basis for job assignment in American society!!, it operates in 
conjunction with race, ethnicity, and age (Smith & Tienda 1988). As Carlson 
(1992:271) reminds us, "sex and race simultaneously structure individuals' 
labor market experiences." Native American women and those of African, 
Asian, and European descent are differently distributed across kinds of work 
(Amott & Matthei 1991, Cassirer 1992, Sokoloff 1992). 


EXTENT AND TRENDS IN SEX SEGREGATION IN THE 
WORKPLACE 


From the time women and men first went out to work, they have done different 
jobs (Goldin 1990). Few occupations employ the sexes in proportion to their 
representation in the labor force, and workers seldom work with persons of 
the other sex in the same establishment, much less in the same location and 
on the same shift (Bielby & Baron 1984). Segregation has been documented 
at every level of economic organization: across economic sectors (Howell & 
Reese 1986), industries (Bridges 1980, Singlemann & Tienda 1985, Lyson 
1986, Wharton 1986, Wallace & Chang 1990), firms (Blau 1977, Bielby & 
Baron 1984), type of employers (e.g. private, public, self-employed—Whar- 
ton 1989), occupations, jobs, job families, and job ladders (DiPrete 1989, 
Strang & Baron 1990) —as well as across branches, shifts, and ranks (Bielby 
& Baron 1984, Reskin & Ross 1992). 

Sex segregation was remarkably resilient over the first 60 years of the 
twentieth century, despite broad economic and social transformations. The 


(Sex segregation within racial groups exceeds race segregation among either sex (e.g King 
1992). 

PPatterns of segregation differ within broad race/ethnic categories For example, the 
occupational distributions of Filipinas and Chinese-Amencan women differ (Cassirer 1992). 
Ignoring race and ethnic differences distorts our understanding of sex segregation and its role in social 
inequality. 

Eor studies of sex segregation across industrial sectors, see Howell & Reese (1986) and Lorence 
(1992); across industries, see Bridges (1980, 1982) and Wallace & Chang (1990), across firms, see 
Bielby & Baron (1984) and Lyson (1985) 
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segregation index computed across detailed occupational categories declined 
slightly if at all between 1910 and 1970. For the entire labor force, the index 
dropped from 69.0 to 67.6 during this interval. If, as Goldin (1990:76) 
claimed, researchers underestimated occupational sex segregation at the turn 
of the. century, segregation has declined more during the twentieth century 
than accepted estimates suggest. 

Men's increased presence in a few female dominated “semi-professions” 
in the 1960s reduced segregation slightly (Blau & Hendricks 1979), but it 
was not until the 1970s that segregation first declined appreciably as women's 
participation grew modestly in several customarily male occupations, and 
disproportionately—compared to the labor force—in a few occupations. As 
a result, by 1980 the segregation index had dropped to 59.8 (Jacobs 1989b). 
Occupational sex segregation declined slightly more among nonwhites than 
whites during the 1970s, as fewer nonwhite women and men pursued heavily 
segregated occupations; among professional workers the decline in segregation 
was limited almost entirely to whites (Beller 1984:20—21). 

According to Current Population Survey (CPS) data, occupational integra- 
tion slowed during the 1980s (Blau 1989). The 1990 index was down about 
two points from 1980; in 1990 out of every hundred women about 57 would 
have had to switch to a majority-male occupation for the sexes to be identically 
distributed across detailed occupations (King 1992).'* Blacks and whites were 
equally sex segregated in 1988, according to CPS data (King 1992). The 
difference between women’s and men’s distribution across industries also 
narrowed during this period for both whites and African Americans (Blau & 
Kahn 1992). The occupations that workers disproportionately entered and left 
during 1981 elucidate the slow pace of change: eight of the ten occupations 
that provided the most new jobs for white men had been at least 80% male 
in 1980, whereas six of the ten occupations that gained the most white women 
had been at least 80% female (Coventry 1992). Nonwhite women and men 
showed similar patterns: with three exceptions the occupations that drew the 
most nonwhite men had been 80% male in 1980, while six of the ten 
occupations in which nonwhite women posted the biggest net increases had 
been at least 8096 female in 1980. Thus, women's unprecedented entry into 
some male occupations was partly offset by the thousands of women and men 
who entered sex-typical jobs during the 1970s and 1980s. Workers’ 


VThe more definitive 1990 census data were not available at the time of this writing. 

I*This pattern will probably continue into the twenty-first century: of the ten occupations 
projected to produce the most jobs between 1990 and 2005, seven were at least two-thirds female and 
two were at least 7596 male in 1980 (Silvestri & Lukasiewicz 1991) The only occupation whose sex 
composition approximately resembled that of labor force was waiter/waitress, and this work tends to 
be quite segregated at the job level. 
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disproportionate exits from sex-atypical occupations (Rosenfeld 1984, Jacobs 
1989a, Coventry 1992) and the resegregation of some customarily male jobs 
as women's work (Blau 1989, Reskin & Roos 1990) have slowed the pace 
of occupational integration. 

Case studies of the occupations in which women—usually white women— 
registered the greatest representational gains in the 1970s and 1980s indicate 
that occupational desegregation did not ensure job-level integration. Intra-oc- 
cupational level segregation confined women to different and typically less 
desirable jobs within feminizing occupations (Reskin & Roos 1990). Thus, 
some of the occupational-level “integration” of the 1970s and 1980s simply 
shifted the locus of segregation from the occupation to the job. Within 
integrated occupations women and men work in different kinds of organiza- 
tions (e.g. public vs private sector, retail vs wholesale, chains vs independents, 
industry leaders vs marginal establishments—Reskin & Roos 1990). Even 
within the same organizations, women are more likely to work in contingent 
roles as part-timers, temporary workers, and contract workers (Callaghan & 
Hartmann 1992). 

These various ways in which a sexual division of labor survives within 
integrating occupations underscore the need for caution in appraising trends 
in segregation: the amount of segregation observed depends on the level of 
analysis as well as the specific population of units studied. In 1985, for 
example, when the index of segregation across all detailed occupations was 
about 58, 48% of 1423 job groups that shared similar work content and a 
common semantic designation within California state government were 
completely segregated (Strang & Baron 1990:492). 

The general patterns of sex segregation described above for the United States 
resemble those in other industrialized countries (Lyson 1986, Rosenfeld & 
Kalleberg 1991b), although the lion’s share of research is on the United States. 
During the 1970s segregation indexes (computed across 14 broad occupational 
categories) for most of 12 industrialized countries were between 40 and 52 
(Roos 1985:51—52). Data for the 1980s for eight industrialized nations reveal 
similar results, although Japan showed less segregation, and the Scandinavian 
countries exhibited more, partly because their family policies encourage 
mothers to work part-time (Rosenfeld & Kalleberg 1991a,b, Charles 1992). 
Jacobs & Lim (1992) found no trend in segregation across 58 countries between 
1960 and 1980, although standardizing the segregation indices to remove the 
effect of changes in countries’ occupational distributions showed declines in 
several countries. In sum, women and men in industrialized societies are less 
segregated at work than in the past, but the majority of each sex still work in 
broad occupational categories in which the other sex is underrepresented or 
absent. Barriers to women’s access to some jobs apparently persist in all 
industrialized countries (Rosenfeld & Kalleberg 1991a). 
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EXPLAINING PERSISTENCE AND CHANGE IN SEX ' 
SEGREGATION 


Explanations for sex segregation are often classified by whether they 
emphasize workers’ (“supply-side”) or employers’ (“demand-side”) actions. 
Supply-side explanations usually focus on female and male workers’ occupa- 
tional choices, while demand-side explanations stress actions by employers, 
including discrimination. Although I use a demand-supply framework to 
organize this review of factors linked to persistence and change in sex 
segregation, it tends to obscure the importance of factors that influence both 
workers and employers. These include sex-role stereotypes and other cultural 
values including occupations’ sex labels, the distributions of occupations and 
industries, government policies and their implementation, and the actions of 
the sex-typical workers. Obviously, occupational sex labels influence both 
employers’ notions about appropriate workers and workers’ ideas about 
acceptable lines of work. Similarly, male workers may inhibit integration both 
by their ability to shape employers’ decisions and by affecting the preferences 
of female workers. Factors that operate on both the supply and the demand 
sides are discussed in the sections on employers as well as workers. 


Employers’ Preferences and Practices 


Employers assign workers to jobs, either directly or through personnel 
practices that govern their internal labor markets; indeed, employers create 
jobs for a substantial proportion of new hires (Granovetter 1974, Rosenbaum 
1985). Therefore employers’ actions contribute substantially to the concen- 
tration of women and men in different jobs (Rosenfeld 1983:638, Bielby & 
Baron 1986b:216, Reskin & Roos 1990). 


EMPLOYERS’ PREFERENCES AND SEX DISCRIMINATION Gatekeepers discrimi- 
nate when they treat people unequally based on an aversion toward one group 
or an affinity toward another that is unrelated to individual performance. There 
is no doubt that discriminatory practices have consigned millions of women 
to sex-typical jobs. Until the mid-1960s the majority of firms had jobs that 
were openly off limits to one sex (Grinder 1961, Strom 1987, Goldin 1990). 
Title VII of the 1964 Civil Rights Act was a turning point. In barring most 
firms with at least 15 employees from assigning jobs on the basis of sex, the 
act made it illegal for employers to impose segregation on workers. Amend- 
ments to Title VII and Presidential Executive Orders in the years that followed 
created a legal foundation for challenging sex segregation. Of particular 
importance was a 1971 Executive Order extending to women the requirement 
that federal contractors take affirmative action to eliminate the effects of past 
discrimination. These laws and regulations subjected discriminating employ- 
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ers to law suits, fines, budget cuts or the loss of federal contracts, and bad 
publicity. The very existence of such rules sends the dual message to 
employers, workers, students, and teachers that a person’s sex should not 
limit their opportunities and that employers who fail to provide equal 
opportunities are suspect. 

The federal Civil Rights Act led the courts to invalidate state laws 
prohibiting women from certain occupations (e.g. tending bar) or certain tasks 
(e.g. lifting more than 35 pounds), although many employers continued to 
cite these no-longer valid laws to justify segregation (Bielby & Baron 1986a). 
Case studies of a few employers or industries under consent decrees (e.g. 
AT&T, the steel industry) showed the integrative effects of regulatory agency 
activity (see Northrup & Larson 1979, Deaux & Ullman 1983). The targeting 
of the banking and insurance industries in the 1970s by the Equal Employment 
Opportunity Commission opened predominantly male jobs in those industries 
to women without dampening women’s dominance of customarily female 
jobs. However, haphazard enforcement limited the direct impact of laws and 
regulations (Reskin & Hartmann 1986:91). When regulatory agencies fail to 
enforce laws, they send a message to employers, condoning segregative 
personnel practices. 

Ultimately, assessing the extent to which anti-discrimination and affirma- 
tive-action laws and regulations curtailed segregation is not possible. In the 
first place, these laws made employers more wary about openly treating the 
sexes differently. Differential treatment has become harder to document, and 
researchers must usually infer discrimination or intervention effects from 
statistical comparisons. Such studies suggest that when anti-discrimination 
laws are enforced, they have contributed to workplace integration (Leonard 
1989, Baron et al 1991). However, the evidence of researchers and litigants 
suggests that discrimination still plays a significant role in perpetuating 
segregation (Levinson 1975, Halaby 1979, Kelley 1982, Blau 1984, Reskin 
& Hartmann 1986, Rhode 1991), surviving because of lackadaisical enforce- 
ment, organizational inertia, and employers' outright opposition to integrating 
jobs. 

-- Just as it is not possible to estimate the extent to which anti-discrimination 
laws have promoted integration, it is equally impossible to determine whether 
the impact of discrimination on perpetuating segregation is declining. Given 
a shift toward a more egalitarian gender ideology during the 1970s (Thornton 
et al 1983) and employers’ sensitivity to community values (Szafran 
1982:185), we might expect discrimination to have declined, but the public's 
ambivalence regarding the desirability of assigning men and women to 
identical kinds of work (Thornton et al 1983), and men's stake in maintaining 
their sex-based privileges (Reskin 1988, Cockburn 1991) suggest that sex 
discrimination is likely to persist as a force in job assignments and promotions. 
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EMPLOYERS' PREFERENCES, SEX STEREOTYPING, AND OCCUPATIONAL SEX 
LABELS Custom contributes to segregation through employers' stereotypes 
about workers and about jobs that produce a sex-specific demand for workers 
for particular jobs. Employers assign the sexes to different jobs in part because 
of beliefs they hold about women and men. Indeed, Bergmann (1986:114) 
contended that a "segregation code" prohibits mixing the sexes as equals and 
reserves upper-level jobs for men. Surprisingly little attention has been paid 
to the effect of employers' gender-role attitudes on their personnel decisions, 
although Reskin & Padavic (1988) found that holding traditional gender-role 
attitudes predisposed supervisors against using women in customarily male 
plant jobs. The evidence for the positive association between female bosses 
and more rapid integration (Baron et al 1991, Kulis & Miller-Loessi 1992) 
may reflect female and male bosses' differential acceptance of gender-role 
attitudes or sex stereotypes. However, evidence regarding the association 
between employers' sex and firms' level of segregation is limited. Given the 
pervasiveness of sex stereotypes 1n our cultural vocabulary, stereotypes are 
available to justify either sex doing many kinds of work, but the stereotypes 
employers cite to justify assigning the sexes to particular jobs (Oppenheimer 
1968:226—27, Bielby & Baron 1986a, Milkman 1987, Reskin & Padavic 
1988) may be post-hoc justifications for segregation (Reskin & Roos 
1990:50—52). 

Occupational sex labels influence job assignments independent of potential 
workers’ actual qualifications (Oppenheimer 1968, Bielby & Baron 19862), 
usually perpetuating—but sometimes reducing—segregation. For example, 
whether the previous incumbent was female and the percentage of female 
educational administrators in a job each increased the probability that a new 
hire would be female (Konrad & Pfeffer 1991). What we label custom is often 
organizational inertia or the "aura of inevitability" (Baron et al 1991:1365) 
that is the legacy of past decisions that took into account workers' sex in 
classifying jobs or setting pay (Bridges & Nelson 1989, Kim 1989, Strang & 
Baron 1990). As a result, large, established civil service agencies displayed 
slower rates of integration between 1979 and 1985 than young (or very old) 
agencies (Baron et al 1991:1386). 

The, customary sex composition of jobs can both retard and accelerate 
women's access to nontraditional work. California state agencies that relied 
on a predominantly female workforce moved toward integration targets more 
rapidly than those with a more male workforce (Baron et al 1991). Similarly, 
women are more likely to enter traditionally male occupations in female- than 
male-intensive industries (Shaeffer & Lynton 1979, Milkman 1987:51, Reskin 
& Roos 199]). As a result, pronounced growth of heavily segregated 
occupations and industries will increase the amount of segregation in a society, 
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and waning segregated occupations and industries will reduce segregation 
(Blau 1989). 


THE DEMAND FOR WORKERS The relative magnitudes of the demand for and 
supply of qualified workers can prevent employers from acting on their 
preferences in filling jobs. An inadequate pool of qualified sex-typical workers 
prompts employers to turn to nontraditional workers. This happens when 
occupational growth outpaces the supply of labor or when other activities such 
as military service draw customary workers out of an occupation’s labor pool. 
As a result, occupational growth, low unemployment, and a shortage of 
qualified male workers have enhanced women’s access to nontraditional jobs 
(Oppenheimer 1970, Skold 1980, Anderson 1982, Abrahamson & Sigelman 
1987, Milkman 1987, Baron et al 1991, Cubbins 1991, Reskin & Roos 1990, 
Lorence 1992:141). Substantial occupational growth can markedly reduce 
segregation. Indeed, an estimated one fourth of the decline in occupational 
sex segregation between 1970 and 1990 stemmed from the growth of 
managerial jobs (Jacobs 1992). Organizational growth is also associated with 
lower segregation in firms and public agencies (Szafran 1984, Bielby & Baron 
1984, Baron et al 1991), although industrial growth during the 1970s was 
not. Occupations in growing industries employed smaller proportions of 
women, while those in declining industries relied increasingly on women 
(Glass et al 1988:264). One indicator of labor supply— unemployment rates 
(either total or male)—Ahas shown inconsistent associations with occupations’ 
sex composition (Szafran 1984:69, Albelda 1986, Abrahamson & Sigelman 
1987, Jones & Rosenfeld 1989:676, Reskin & Roos 1991). Thus, while 
growth is neither necessary nor sufficient for a decline 1n sex segregation, 
most research suggests that growth fosters women's access to male occupa- 
Gons. Research on the conditions under which growth has an integrative effect 
is needed. 


ECONOMIC PRESSURES The costs of hiring, training, and paying prospective 
workers influence employers' personnel decisions, regardless of whether they 
seek to make a profit. All other things being equal, most employers prefer 
the cheapest qualified workers available, so women's lower market wage 
should make them attractive candidates for male- as well as female-dominated 
jobs. In reality, of course, men's higher average pay has not induced very 
many employers to employ women in typically male jobs; countervailing 
forces discussed in this essay intervene. However, employers particularly 
pressed to reduce costs or increase profits should rely more heavily on women 
for sex-atypical jobs. Specifically, highly competitive economic sectors, 
economically marginal industries, and labor-intensive jobs should be less 
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segregated than their opposites. Research regarding this expectation is mixed. 
Supporting the hypothesis are findings of greater segregation in capital-inten- 
sive than in labor-intensive industries (Bridges 1980:66, Milkman 1987), in 
nonservice industries than the service sector (Lorence 1992), and in core than 
periphery firms (Lyson 1985, Reskin & Roos 1991). Also, the more 
concentrated in the South manufacturing jobs were, the more female an 
industry’s workforce (Wallace & Chang 1990). But other studies cast doubt 
on this hypothesis. Jobs in monopolistic industries and core firms were less 
segregated than in more competitive sectors (Bielby & Baron 1984); manu- 
facturing industries that enjoyed greater market concentration and hence could 
afford higher-priced workers nonetheless were more likely to employ women 
than those with less market concentration (Wallace & Chang 1990); manu- 
facturing industries with fewer resources segregated blue-collar workers more 
than resource-richer settings (Wharton 1986); and California public agencies 
made headway in integrating jobs despite property-tax limits that drastically 
cut state revenues (Baron et al 1991:1394). Longitudinal designs and 
firm-level research are needed to disentangle these inconsistent findings 
regarding employers' resources, their economic power, and their propensity 
to segregate workers. 

Some scholars have hypothesized that employers with slack resources or 
who are buffered from pressures to maximize profits (e.g. the public sector, 
monopolies, and industries with cost-plus government contracts) can indulge 
a taste for discrimination in assigning workers to jobs (Stolzenberg 1982, 
Szafran 1982, Cohn 1985:18—22). Although there is some historical support 
for this reasoning (e.g. Cohn 1985, Milkman 1987:68), administrative 
oversight and political forces constrain how employers use discretion (Szafran 
1982). Recently external forces have imposed costs on employers who 
segregated the sexes. Anti-discrimination regulations of the 1960s and early 
1970s made it illegal for employers to exclude workers from any jobs because 
of their sex. Two federal agencies—the Equal Employment Opportunity 
Commission and the Office for Federal Contract Compliance Programs—as 
well as various state agencies are charged with enforcing discrimination bans 
and affirmative-action programs for federal contractors, Thus, political 
pressure on public employers and private government contractors to practice 
equal employment may offset their insulation from economic pressures. 
Indeed, public agencies and federal-contract holders tend to be less sex 
segregated than the private sector and noncontractors (Salancik 1979, Beller 
1982, Wharton 1989). Employers' responsiveness to external pressures is seen 
in greater declines in segregation in agencies dependent on federal funds, as 
well as those sanctioned by the California State Personnel Board, and those 
in which outsiders rather than an agency's own affirmative-action personnel 
oversaw staffing activities (Baron et al 1991:1394). However, the integrative 
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effects of enforcement efforts and the threat of litigation appear to be short-run 
(Beller 1982:390, Deaux 1984, see also Rosenbaum 1985), which is not 
surprising given the limited federal enforcement activity in the 1980s 
(Burbridge 1984). 

Any factor that reduces the difference in the cost or profitability associated 
with employing female versus male workers may affect employers’ propensity 
to employ one sex rather than the other. If employing women raises rather 
than lowers employers’ total wage bill, the economic advantage of hiring 
women disappears. Research points to several factors that can mitigate the 
price savings of female labor. The first of these is equal-pay-for-equal-work 
Tules. Before World War II, most equal-pay claims among automobile and 
electrical manufacturing workers came from men objecting to women’s 
assignment to previously male jobs, and the result usually was a restoration 
of a sexual division of labor that preserved the pay gap between the sexes 
where rules required equal pay for equal work (Milkman 1987:47, 43). Indeed, 
Milkman (1987:147) concluded that the United Auto Workers’ successful fight 
for equal-pay during the war contributed to management’s post-war preference 
for male labor. 

Another factor that can offset the integrative effect of women’s lower pay 
is male workers’ actions. If male incumbents insist on a wage premium to 
work with women (Bielby & Baron 1986a), sabotage women’s productivity 
(Bergmann & Darity 1981, Cohn 1985), or disrupt production by threatening 
strikes or striking (Milkman 1987:43, Rose 1988), they can nullify any savings 
integration might yield. 

Another factor that can mitigate the effect of women’s lower pay is the 
response by customers. Women’s lower pay will not win them jobs if 
employers believe that they will drive away customers. Some employers have 
cited customer discrimination to justify barring women from certain jobs, but 
the courts have ruled that customers’ preferences are not a legitimate business 
reason for sex discrimination. No studies have systematically established the 
extent to which customers’ preferences for workers of a particular sex have 
contributed to the persistence or decline of segregation. However, case studies 
of male occupations that feminized after 1970 illustrate integrative effects of 
customers’ perceived preferences. To cite two examples, corporate recognition 
of the growing power of the purse brought more jobs for women public-re- 
lations specialists, and insurance companies hired female sales agents to serve 
a prowing female market (Reskin & Roos 1990:77). However, a female market 
is not sufficient to create a sex-specific demand for women. For example, the 
“women-for-women” campaign failed to open embalming jobs to women at 
the end of the last century (Rundblad 1992), and although women reportedly 
buy two out of every five cars, almost 90% of car and boat salespersons were 
male in 1990. More research is needed to unravel the conditions under which 
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customers’ preferences contribute to the perpetuation or reduction in job 
segregation. 


Statistical Discrimination 


Many employers use sex as an inexpensive screen to fill jobs, given beliefs 
that one sex is more productive at certain tasks or more likely to justify 
investments in on-the-job training (Phelps 1972). Employers’ reservations 
about women may be based on presumed differences in strength or tolerance 
of adverse working conditions or the belief that women’s domestic roles will 
lead to excessive absences and turnover rates. To the extent that employers’ 
assumptions about the sexes lead them to assign individual men and women 
to different jobs, they are discriminating “statistically.” Regardless of whether 
statistical discrimination is based on actual sex differences or invalid stereo- 
types (i.e. “error discrimination" —Aigner & Cain 1977, England & McCreary 
1987, Bielby & Baron 1986a), it usually contributes to sex-typed job 
assignments. For example, employers segregate men into jobs with physical 
demands and women into jobs demanding social skills (Bielby & Baron 1984, 
Farkas et al 1991). However, employers’ use of sex in job assignments exceeds 
technical or economic justifications: within the “mixed-sex” occupations that 
either sex could presumably perform, small differences in job requirements 
were accompanied by large differences in sex composition (Bielby & Baron 
1986a:782). 


EMPLOYERS’ PERSONNEL PRACTICES A growing literature attests to the 
importance of organizations’ personnel practices on their degree of segrega- 
tion. Personnel practices are the formal and informal procedures that comprise 
firms’ external and internal labor markets. How employers recruit workers 
for entry-level jobs affects the sex composition of those jobs. Reliance on 
informal networks through which employees tell acquaintances about openings 
tends to perpetuate segregation because social networks tend to be segregated 
(Braddock & McPartland 1987). In contrast, formal recruitment procedures 
such as advertisements facilitate integration (Szafran 1982, Roos & Reskin 
1984). 

Internal labor markets can preserve segregation (Baron & Bielby 1985, 
Hartmann 1987), especially when entry-level jobs are in segregated job 
families. As a result, the higher a Job is on the ladder, the more skewed its 
sex ratio (Baron & Bielby 1985).'° Even a gender-neutral promotion system 
can contribute to segregation, as in the New York civil service where the 
primary stumbling block to women and minorities in higher-level jobs was 


I5Baron & Bielby (1985) noted that this pattern 1s also consistent with organizations allowing men 
but not women to circumvent formal promotion rules 
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their underrepresentation in jobs eligible for promotion (Steinberg et al 1990, 
see also DiPrete & Soule 1988). 

Other critical features of internal labor markets that can preserve—or 
reduce—segregation include seniority, rules, transfer policies, required cre- 
dentials that are less common among one sex, and job-posting practices 
(Kelley 1982, Deaux 1984, O’Farrell & Harlan 1984, Remy & Sawers 
1984:107-8, Roos & Reskin 1984, Milkman 1987:37). Establishments have 
integrated the sexes by altering their hiring and promotion practices. Effective 
modifications include seniority overrides, “bridges” connecting clerical lad- 
ders to professional, technical and administrative ladders, elimination of 
unnecessary job requirements that women tend to lack, and expanded job 
posting (Northrup & Larson 1979, O'Farrell & Harlan 1984, Baron et al 
1991). Restructuring internal labor markets is especially effective when 
job-assignment procedures and mobility paths are highly formalized (Baron 
et al 1991:1393). Of course, sex-neutral promotion policies cannot eliminate 
segregation unless entry-level positions are integrated (DiPrete & Soule 1988). 
Moreover, organizations are not likely to make wholesale changes in their 
personnel procedures without pressure from watchdogs or organizational 
constituents, a situation that further impedes integration. For example, Bridges 
& Nelson (1989:651) found that agencies and unions were less likely to 
intervene actively for wage adjustments on behalf of state workers in mixed 
or female jobs. Baron and his colleagues’ (1991:1395) findings for California 
state agencies further establish the importance of coalitions inside and outside 
agencies for modifying career outcomes and personnel practices (see also 
DiPrete 1989). Perhaps this helps explain why the effects of changes appear 
to be limited to targeted jobs (Baron et al 1991:1393—94). 

Among the most segregated and the slowest to integrate are small 
entrepreneurial firms and large, bureaucratic establishments (Lyson 1985, 
Baron et al 1991). However, the effects of size are complicated and the 
evidence inconsistent (Wallace & Chang 1990). To some degree, size is 
confounded with enforcement pressures because small firms are exempt from 
some regulations and because enforcement agencies have targeted large firms 
(Reskin & Hartmann 1986). Also, large corporations’ publicized commitment 
to equal opportunity may generate pressure from female employees to expand 
career opportunities. However, growth makes integration easier, and efforts 
to implement affirmative action programs have been especially effective in 
enhancing women’s access to some customarily male jobs in growing 
occupations (DiPrete 1987, Reskin & Roos 1990, Baron et al 1991). 

From a theoretical standpoint, bureaucracies should be less segregated than 
nonbureaucratic enterprises. Some research bears out this expectation. Orga- 
nizations that centralize personnel decisions have more balanced representa- 
tion by women and blacks (Szafran 1982:181, Baron et al 1991). Government 
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jobs, which are both bureaucratized and vulnerable to political pressure 
(DiPrete 1987), are less segregated than those in the private sector (Wharton 
1989). However, Bielby & Baron (1984) observed more segregation in large 
bureaucracies than in smaller establishments. In part this reflects the large 
number of job titles in large bureaucracies, but the more extensive segregation 
they observed in bureaucracies is not solely an artifact of the number of job 
titles. Using 1985 data for California state agencies, Strang & Baron (1990) 
found that within integrated occupations, job-title proliferation by rank, 
functional specialty, and organizational location led to job categories that were 
more segregated than in the parent “job stem” (job groups with similar work 
content and titles). These results are consistent with agencies’ using job 
proliferation to preserve sex segregation. 

The slow pace of integration, even in organizations under external pressure 
to reform, suggests that structural alterations in employers’ hiring and 
job-assignment practices are often limited in extent or effectiveness (O’ Farrell 
& Harlan 1984, Baron & Bielby 1985, DiPrete & Soule 1988, Baron et al 
1991). Hampering integration in such settings are organizational inertia, filling 
jobs through informal networks or based on seniority, the seniority gap 
between the sexes, low rates of turnover, job loss, and economic stagnation. 
Even in organizations committed to reform, reforms geared to reducing sex 
inequality can be subverted by the people charged with implementation 
because they wish to maintain the status quo or achieve altogether different 
goals (Steinberg 1987, Reskin & Padavic 1988). 


Workers’ Preferences and Actions 


THE SUPPLY OF WORKERS The supply of qualified workers of each sex sets 
limits on occupations’ sex composition (Reskin & Roos 1990). Repeatedly 
throughout recent history, a shortage of male workers has opened the doors 
of customarily male occupations to women (Strober 1984, Strober & Arnold 
1987, Reskin & Roos 1990). The effect of female labor supply 1s less clear. 
Some evidence suggests that the more women in a local labor market who 
are in the labor force, the less segregation—at least for some occupations 
(Abrahamson & Sigelman 1987, Lorence 1992), but others (Semyonov & 
Scott 1983, Jones & Rosenfeld 1989:687) have reported contradictory results. 
Of course, for women to profit from a shortage of men, a pool of qualified 
women must exist. The supply of educated women in a labor market is 
negatively related to its level of segregation (Abrahamson & Sigelman 1987, 
Jones & Rosenfeld 1989), and a large number of already trained women has 
facilitated women’s entry into such feminizing occupations as accounting, 
book editing, and systems analysis (Reskin & Roos 1990). 

Most supply-side theories of segregation assume that women and men 
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“supply” themselves to different occupations, and the remainder of this section 
reviews the evidence for this assumption. 


WORKERS’ PREFERENCES Both the socialization and neoclassical-economic 
perspectives on segregation contend that workers’ occupational outcomes 
reflect their preferences. Socialization theories emphasize the different pref- 
erences the sexes develop before reaching adulthood, while neoclassical 
economists assume that in order to maximize their lifetime earnings, the sexes 
select different occupations, given their differing adult roles. 


Neoclassical economic explanations of segregation Neoclassical economists 
trace sex segregation to women’s expectations that family obligations—espe- 
cially childrearing —will limit their market work (Mincer & Polachek 1974, 
Polachek 1981).! | 

Many researchers have investigated whether sex differences in education, 
training, and experience contribute to women's underrepresentation in certain 
jobs. The sexes are identical in median years of education, although men are 
less likely to finish high school. Men are also more likely to complete 
postgraduate study, but women are catching up. Moreover, predominantly 
female and male occupations demand simular levels of schooling and cognitive 
skills (England et al 1982). At the level of the local labor market, women’s 
median education is negatively associated with the level of segregation 
(Abrahamson & Sigelman 1987), as are the proportion of female college 
graduates and women’s share of male professions (Jones & Rosenfeld 
1989:681). At the individual level, attending college increased a woman’s 
rate of moving to a majority-male occupation (Rosenfeld 1983:647), and while 
higher education did not affect female high school graduates’ movement 
between sex-typical and sex-atypical jobs, it was associated with their staying 
in a predominantely male occupation when they changed jobs (Rosenfeld & 
Spenner 1992). These results are consistent with a human-capital explanation 


l7Neoclassical writers tend to assume women’s greater involvement in child rearing rather than 
trying to explain it Women’s expected domestic roles hypothetically (1) disincline them from 
investing ın education, training, and experience, thereby reducing their qualifications for 
customarily male jobs, and (11) predispose them toward occupations that do not penalize the skill 
depreciation that may occur during periods away from the labor force, are easy to re-enter, and let 
women conserve their energy for their domestic obligations. 

lŝMale and female students’ college majors, although different, have been converging (the 
segregation index was 51.4 1n 1964, and 35 m 1984—Jacobs 1989a:116). Moreover, the link 
between college major and the sex-typicality of subsequent employment has not been established, 
although Reskin & Roos (1990) argued that the large number of women with degrees in accounting, 
business administration, computer science, and pharmacy provided a supply of prospective workers 
when demand pressures prompted employers to turn to women. 
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for women’s underrepresentation in occupations that require advanced edu- 
cation, but they are equally in keeping with the possibility that women must 
overqualify themselves to compete successfully for male jobs (Reskin & Roos 
1990). 

Few researchers have assessed the effect of vocational training on women’s 
access to sex-atypical jobs. Jobs filled predominantly by females require and 
provide less training (England et al 1982, Bielby & Baron 1986a), but this 
may reflect employers’ insistence that workers in traditionally female occu- 
pations bear the cost of their training, as well as the limitations the educational 
system, employers, and unions have imposed on women’s opportunity to 
obtain training, especially training geared specifically to traditionally male 
jobs (e.g. Deaux 1984:294, Reskin & Hartmann 1986). 

The human-capital approach to segregation holds that male and female 
occupations require different skills and that women avoid occupations that 
demand skills that depreciate while they are out of the labor force raising their 
children (England 1982, 1984). Two issues obscure our understanding of the 
relationship between training, skills, and the sex composition of jobs. First, 
sex bias in conceptualizing and measuring skill vitiates the concept's 
explanatory power (Reskin 1988, Steinberg 1990). For example, male workers 
have been more successful in enforcing the definition of their jobs as skilled. 
Second, both workers’ sex and the sex composition of occupations affect 
workers’ chances to acquire skills and their access to skilled jobs. Particularly 
in the crafts and the professions, men's control of training has enabled them 
to prevent women from acquiring skills. Thus, the causal ordering of the 
association between sex, occupational sex composition, and occupational 
skills is far from clear. This important topic deserves careful study. 

Most of the evidence on the effect of women's labor-force attachment is 
inconsistent with the neoclassical account of segregation that holds that 
because women anticipate intermittent employment, they opt for occupations 
whose starting pay is high and which do not penalize intermittency. In fact, 
discontinuous employment had little effect on women's income (Corcoran et 
al 1984), and the penalty for skill depreciation was unrelated to an 
occupation’s sex composition (England 1982). Although there is some 
evidence that female jobs have lower returns to seniority than male jobs (Filer 
1983), the findings that female-dominated occupations have lower starting 
pay than male occupations and that women's earnings do not appreciate any 
faster in predominantly male than in predominantly female occupations 
(England et al 1988:554) cast doubt on the neoclassical account. 

Whether sex differences in work commitment contribute to segregation is 
a separate question. Women average fewer hours per year than men (Reskin 
& Hartmann 1992), disproportionately female occupations are more likely to 
employ workers part-time and part year (Beller 1982, Rosenfeld 1983, 
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Cubbins 1991), and reducing one’s hours from full to part-time is associated 
among young women with a move to a mostly female occupation (Rosenfeld 
& Spenner 1992). However, both demand- and supply-side causal orderings 
of the link between occupations’ sex composition and part-time work are 
plausible: women who seek part-time work are presumably drawn to jobs that 
offer it—i.e. female occupations, but employers sometimes structure jobs that 
they have labelled as female as part-time (Reskin & Roos 1990:58). Other 
job characteristics of typically female jobs are not particularly compatible with 
family roles: for example, job flexibility and unsupervised breaks when 
workers can call home were less common among female than male occupations 
(Glass 1990:791). Moreover, although women were more likely than men to 
value jobs with characteristics that mothers would find convenient, women 
with small children were less likely to endorse such values (Walker et al 
1982:27). 

In view of these results, it is not surprising that between married and 
single women and between women with and without children differences 
in occupational sex type often run counter to neoclassical predictions: The 
probability of working in a nontraditional occupation increased with women’s 
number of children (Beller 1982:383); being married did not affect the 
sex-typicality of women’s job moves (Rosenfeld 1983); single women were 
slower than women with young children to leave female for male jobs 
(Rosenfeld & Spenner 1992); and motherhood predisposed women toward 
transferring to predominantly male plant jobs in the same firm (Padavic 
1991). However, rotating shifts which wreak havoc with childcare arrange- 
ments disinclined women from plant jobs (Padavic 1991). Women with 
children are more likely than other women to do paid work at home, but 
both female- and male-dominated occupations employ significant numbers 
of homeworkers (Kraut & Gramsh 1987). Finally, two studies cast doubt 
on the hypothesis that women opt for female jobs because they demand 
less effort or commitment: Bielby & Bielby (1988) found that female 
respondents reported slightly more effort than their male counterparts: net 
of household responsibilities, human-capital measures, and job rewards, 
women invested substantially more effort to work than comparable men (p. 
1048). Using 1991 data, Marsden et al (1992) found no significant sex 
difference in organizational commitment; indeed, women were slightly more 
committed to their employer than men who were similar on family ties, job 
attributes, and career variables. 

In summary, despite the intuitive appeal of the human-capital explanation 


In 1991, 10.5% of employed men worked part-time, compared to 25.696 of women (US Dept. 
Commerce 1992. Table 626) About one quarter of women work part-time because they cannot get 
fuli-time jobs (Applebaum 1987) 
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of sex segregation, its assumptions lack empirical support and many have 
been disconfirmed. 


Gender-role socialization and occupational aspirations Differences in the 
sexes’ socialization may contribute to their concentration in different occupa- 
tions by (i) fostering a bent toward specific occupations labelled as appropriate 
for their sex or an aversion for those jobs defined as inappropriate, (11) creating 
a taste for or against working conditions common in sex-typical or sex-atypical 
jobs, (iii) teaching only those skills needed in sex-typical occupations, and 
(iv) providing information only about sex-typical jobs. In part because of 
socialization, many young people enter the job market with preferences and 
traits consistent with sex segregation, although less so than in the past (Marini 
& Brinton 1984, Subich et al 1989). Moreover, the traditionality of young 
women’s occupational preferences is associated with their expected commit- 
ment to the labor force (Waite & Berryman 1985), although a study of women 
in one company found that women’s sex-role attitudes did not affect their 
interest in transferring to a predominantly male plant job (Padavic 1992). 
Researchers have established a link between high school girls’ aspirations and 
their mothers’ work patterns. Girls whose mothers worked in blue-collar jobs 
preferred traditionally male occupations, and living in a female-headed 
household slightly predisposed girls toward a male occupations (Waite & 
Berryman 1985). Other role models presumably also affect young women’s 
aspirations. The convergence of the sexes’ college majors and young women’s 
increased knowledge of women pursuing nontraditional careers have probably 
enhanced their interest in sex-atypical jobs (Goldin 1990). 

Female high school seniors’ emphasis on homemaking or work was linked 
to their adult jobs. Women who, as students, had expressed stronger 
orientations toward work were slower to leave majority-male occupations, 
and those who expected to hold high-status jobs were quicker to abandon 
predominantly female occupations for male ones (Rosenfeld & Spenner 1992). 
While traditional gender-role socialization may predispose some women to 
customarily female jobs (see Subich et al 1989 for a review of the literature), 
perceptions about what jobs are open to them also shape workers’ preferences 
(Reskin & Hartmann 1986). 

Studies of the stability of young people’s occupational aspirations and their 
link to their adult occupations speak to the impact of gender-role socialization 
on sex segregation. Jacobs (1989a) has shown that young people’s occupa- 
tional aspirations are weakly associated with either their later aspirations or 
their actual occupations, replicating results indicating frequent migration 
between sex-typical and sex-atypical occupations. For example, Rosenfeld 
(1984) found that about 30% of women who changed jobs moved from an 
occupational category dominated by one sex to one dominated by the other 
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(also see Corcoran et al 1984). Such moves produce low correlations between 
the sex composition of successive occupations (Jacobs 1989a, Rosenfeld & 
Spenner 1992). However, the studies of mobility across sex-typical and 
sex-atypical occupations share a common weakness. Because they classify as 
sex-typed all those jobs in occupations with more than an arbitrary proportion 
of one sex, some moves between sex-typical and sex-atypical occupations do 
not necessarily involve a move between sex-typical and sex-atypical jobs. 
Nonetheless, the instability of people's occupational aspirations calls into 
question the long-run impact of pre-employment preferences for adults’ 
segregation into different kinds of work. Instead many workers apparently 
respond to the available opportunities, regardless of their sex labels. Consistent 
with the operation of a sex-differentiated opportunity structure that mitigates 
the effects of people’s pre-labor-market preferences is the finding that young 
men’s occupational aspirations were lower than the occupations they held at 
age 30, while young women settled for less desirable occupations than they 
had imagined as youths (Rindfuss et al 1992). 


Job opportunities and occupational aspirations A variety of intervening 
factors that affect people's jobs weakens the link between the sex-typicality 
of workers' aspirations and their actual jobs. Barriers and opportunities modify 
workers’ preferences and constrain the jobs they obtain, channeling most 
workers into sex-typical jobs. Factors that signal workers’ access to occupa- 
tions and industries include their sex labels and firms' hiring practices reflected 
in the sex makeup of their workforce (Heilman 1979, Reskin & Hartmann 
1986). Thus, women expressed greater interest in sex-neutral jobs that were 
depicted with female rather than male workers (Bem & Bem 1973). 
Perceptions of barriers and opportunities affect workers' aspirations; actual 
barriers and openings affect workers' employment in sex-typical and sex-atyp- 
ical jobs. The deterrent effect of required apprenticeships, licenses, or 
credentials (Roos & Reskin 1984, Cubbins 1991) operates partly by commu- 
nicating to women that jobs are closed to them (Reskin & Hartmann 1986, 
Steinberg et al 1990:284). 

Men's exclusionary actions can influence women's preferences by indi- 
cating that women are welcome or unwelcome in men's jobs (Schroedel 
1985, Cockburn 1988, Jacobs 1989a, Rosenfeld 1983). The desire to protect 
their occupational turf can lead workers to close ranks against outsiders 
(Hodge & Hodge 1965) or to make interlopers less attractive to employers 
by sabotaging their work (Bergmann & Darity 1981). Male incumbents’ 
ability to exclude women depends on their power. Several proxies for men’s 
power are negatively linked to women’s share in an occupation, including 
men’s average job tenure (Reskin & Roos 1990), the proportions who are 
white, speak English, and are prime-age, and their median educational level 
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(Bridges 1982, Reskin & Roos 1991). Although empirical studies cast doubt 
on how frequently men’s actions actually drive women out of jobs (O’ Farrell 
& Harlan 1982, Waite & Berryman 1985, Padavic & Reskin 1990), historical 
studies have traced unions’ exclusion of women from many lines of work 
(Milkman 1980, Rose 1988, Cobble 1990, Roos 1992). For example, the 
Hotel and Restaurant Employees and Bartenders International Union won 
state laws banning women from bartending (Detman 1990). The evidence 
regarding contemporary unions’ effect on sex segregation is less clearcut. 
In the 1960s and early 1970s, California firms under union contract were 
more segregated than their nonunionized counterparts (Bielby & Baron 
1984), but in the late 1970s, unionized California manufacturing plants 
displayed no more sex or race discrimination than nonunionized plants 
(Leonard 1984). Other research suggests that the unions’ effect on segre- 
gation varies (Bridges 1982:287, Wallace & Chang 1990, Baron et al 1991, 
Reskin & Roos 1991) and cannot be easily summarized. For example, the 
level of unionization in an occupation was positively related to women’s 
representation in occupations that required little training, but negatively 
related to women’s representation in occupations that required more than 
two years of training (Cubbins 1991). The higher levels of segregation 
sometimes seen in unionized establishments may partly reflect unionized 
jobs’ emphasis on seniority and their lower turnover rates that retard the 
pace of integration (Leonard 1984) as well as the proliferation of job titles 
resulting from union contracts (Baron 1992). However, unions differ on 
their positions on equal employment and their response to the competitive 
threat women might pose. An alternative to excluding them is insisting that 
employers pay women the same as men for equal work (Milkman 1980), 
thereby reducing women’s attractiveness to employers as cheaper labor. 
These inconsistent findings point to the need for systematic study of the 
conditions under which unions foster or impede equal access to jobs. 

Of course, men’s stake in excluding women depends on their attachment 
to an occupation and its desirability. Case studies of male occupations that 
have feminized suggest that employers turn to women after men have 
abandoned an occupation. From these case studies Reskin & Roos (1990) 
argued that occupations’ sex composition changes in response to changes in 
the job queues and labor queues that the labor market comprises. As male 
workers eschew deteriorating jobs for more desirable opportunities, the labor 
shortage forces employers to descend lower 1n the labor queue to hire women. 
In other instances, employers reordered the labor queues for customarily male 
jobs in response to pressures that made women workers more attractive. 
Analyses of the 1970~1980 changes in occupations’ sex composition are 
consistent with this queueing perspective (Reskin & Roos 1992), as is Jones 
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& Rosenfeld’s (1989) finding that men competed with women for predomi- 
nantly male white-collar jobs but not for low-ranked heavily female white- 
collar jobs. 

A variety of societal and economic changes have enhanced women’s 
perceived access to men’s jobs. The women’s liberation movement challenged 
traditional norms that had consigned the sexes to different kinds of work, and 
state and federal regulations barred discrimination and sometimes mandated 
sex equity in education, vocational training, apprenticeship programs, and the 
workplace (Reskin & Hartmann 1986). These political and ideological changes 
were reinforced by demographic and economic forces—the growth in female- 
headed households and the decline in families’ real earnings—that tied women 
more permanently to the labor force. Thus, during the same time women’s 
need for “male” incomes grew, society gave women permission to pursue 
nontraditional lines of work, and employers faced pressures to hire workers 
without regard to their sex. 


Workers’ values and occupational aspirations, and job characteristics Re- 
search generally fails to support the hypothesis that the sexes’ work values 
orient them to different occupations. The sexes are more similar than different 
in their work values, although research on sex differences in work-related 
values has produced mixed findings. Several studies from the early 1980s 
indicate that men value autonomy, authority, and opportunities for promotion 
more than do women (see England 1992:19 for a review), although these 
results may reflect women’s adaptation to the low-authority dead-end jobs. 
Other studies show no sex differences in valued job characteristics (Jencks et 
al 1988, Loscocco 1990, Walker et al 1982). The latter studies imply that 
customarily male jobs should attract both sexes, and they do (Reskin & Roos 
1990). Women in blue-collar jobs preferred the intrinsic and extrinsic job 
rewards more available to their male counterparts (Loscocco 1990), and young 
women in the National Longitudinal Survey expressed no special preference 
for characteristics common in female occupations and were willing to work 
for lower pay in exchange for characteristics associated with male jobs (Reed 
& Holleman 1988). 

Some job characteristics that women are more likely than men to value are 
not unique to predominantly female jobs; indeed, some are more common in 
male jobs. In addition, women do not necessarily realize their preferences in 
predominantly female jobs, nor do various values influence what jobs they 
accept. For example, women like clean conditions and good collegial relations 
(Loscocco 1990, Padavic 1992), but these preferences were unrelated to their 
inclination to transfer to a male plant job (Padavic 1992). Although the need 
for physical strength and exposure to job hazards are more common in 
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disproportionately male occupations (Cubbins 1991, Jacobs & Steinberg 
1990), on average, women's jobs are no cleaner than men's jobs and are 
noisier (Reed & Holleman 1988, Jacobs & Steinberg 1990), and workers in 
predominantly male occupations are more likely than those in female 
occupations to say their work is easy (Glass 1990:792). 

A particular attraction of male jobs is their higher pay. The economic 
advantages of predominantly male jobs have been exhaustively documented 
(Treiman & Hartmann 1981, Marini 1989). The higher pay and better benefits 
of male jobs (Perman & Stevens 1989) are particularly attractive to women 
supporting families (Padavic 1992), whereas women in female jobs that pay 
as well as men's jobs are less drawn to male jobs (Padavic & Reskin 1990, 
Rosenfeld & Spenner 1992). Customarily male jobs also provide better 
promotion opportunities (Rosenbaum 1985:130, Glass 1990, Steinberg et al 
1990) and are more likely to confer authority (Jacobs & Steinberg 1990, 
Reskin & Ross 1992), all job attributes that women value. 


DISCUSSION AND CONCLUSION 


As is true in many industrialized countries, occupational-level sex segregation 
has declined more since 1970 than in the previous 60 years. However, the 
pace of integration slowed in the 1980s, and in the early 1990s few occupations 
had sex compositions resembling the labor force. Although many exceptions 
exist, at the job level sex segregation generally remains high. We must be 
cautious in inferring workplace integration from declines in occupational 
segregation. Not only has the drop in occupational segregation masked 
job-level segregation (Reskin & Roos 1990), but as Jacobs (1989b) has shown, 
the door to sex-atypical jobs is a revolving one: more women passed through 
this door after 1970 than in the past, but various forces shunted many of them 
back into traditionally female jobs. 

Social and economic forces both perpetuate and reduce segregation. 
Traditional gender-role socialization has inclined young women and men 
toward different occupations, but its direct contribution to sex segregation is 
probably small. Young people's occupational aspirations fluctuate consider- 
ably before they enter the labor force and bear a weak relationship to the jobs 
they later hold. The sex-typicality of workers' jobs is also unstable, reflecting 
the facts that it is employers who assign workers to jobs and that workers 
respond to opportunities that present themselves. In view of the fact that both 
sexes value many features that are more characteristic of male than female 
jobs, men's monopoly of these more desirable jobs more likely reflects their 
ability to command better jobs than women's active pursuit of less attractive 
jobs. The slowing decline of segregation in the 1980s, despite women's 
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increased experience and attachment to the labor force, their convergence 
with men’s aspirations and college majors, and their growing economic need 
to work, further attests to the institutionalized forces that channel women into 
customarily female jobs. 

Most discussions of sex segregation treat the disparities in the rewards that 
predominantly male and predominantly female jobs offer as consequences of 
segregation that establish its legitimacy as a social problem. Viewing 
segregation as a fundamental process of social inequality alerts us to their 
causal role in perpetuating sex segregation. Women’s concentration in 
different jobs than those that employ men is one of the most dramatic 
expressions of sex differentiation (Reskin 1988). That most women work in 
jobs that pay less than men’s jobs pay and do not lead to authority over men 
is fundamental for maintaining a system of sex inequality in which many men 
have a stake (Hartmann 1976, Reskin 1988, Cockburn 1991). 

Recognizing segregation's role in the system of sex stratification informs 
an agenda for future research. We need to know more about how workers 
end up in specific jobs—both in terms of their own actions and those of 
employers. A stratification perspective on segregation points to the need for 
studies of the organizational, economic, and individual factors that condition 
male workers’ response to women’s entry. More generally, studies of workers 
that take into account the structural constraints that affect workers' actions 
should be fruitful. On the demand side, we need research on the role of 
employers in creating jobs, formulating requirements, making job assign- 
ments, linking jobs on ladders, and promoting workers. Studies of 
organizations' personnel policies and practices have illuminated and can 
elucidate these issues. On the supply side, we need to know about the factors 
that influence workers' moves into and out of sex-atypical jobs, including the 
effects of family members, and how these may vary for persons from different 
racial/ethnic groups and economic circumstances. At the occupational level, 
we need to know the contribution made by various groups of workers—new 
labor-market entrants, movers, and retirees—to changing sex composition. 
Studies that recognize the interrelationship between individuals’ social and 
economic circumstances, organizations' structure, resources, and history, and 
employers' actions will bring us closer to understanding persistence and 
change in segregation. 
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Abstract 


This review focuses on formal theories and models of collective action. There 
are many types of collective action, and they cannot all be captured with the 
same formal model. Four types of models are reviewed: single-actor models 
which treat the "group" behavior as given; models of the interdependent 
aggregation of individual choices into collective action; models of the 
collective decisions of individuals with different interests; and models of the 
dynamic interactions among collective actors and their opponents. All models 
require simplifying assumptions about some aspects of a situation so that 
others may be addressed. Models of the aggregation of individual choices 
have shown the greatest recent growth, have employed a wide variety of 
assumptions about individual behavior and coordination mechanisms, have 
identified complex interaction effects of group heterogeneity, and generally 
exhibit thresholds, discontinuities, and internal group differentiation. Models 
of dynamic interactions require further development but promise to be 
enriched by accumulating empirical time series data on collective events. 
Greater attention is urged to technical issues of formal symbolic mathematical 
analysis, experimental design, response surface analysis, and technical 
problems in the reduction and presentation of complex interactions. 


INTRODUCTION 


Formal collective action theory has undergone an enormous growth and 
elaboration within the past few years, with a major shift from focusing on 
individual decisions to focusing on group structure and interaction. This theory 
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draws on and speaks to the larger empirical literature on collective action and 
social movements, but mostly in indirect ways. Only for the very simplest 
individual decision models have there been empirical studies that could be said 
to test predictions. This article centers on the newest formal theory, with brief 
reviews of older formal theories and empirical data on individual decisions. ! 1 

This review begins with a brief summary of the historical grounding of 
collective action theory in Mancur Olson’s work, and of the way the problem 
has been reformulated by subsequent theorists. It provides abbreviated reviews 
of public goods economics and of models of individual decisions which treat 
the “group” as given, including a sketch of some of the empirical findings 
that reflect on such models. The core of this review discusses models that 
treat collectivities rather than individuals as the units of analysis. A brief 
conclusion points to the complex and contingent nature of the newest findings 
and shows how some patterns are emerging amidst the complexity. 

The process of preparing this article involved a close review of the 
mathematics, but it is impossible within its space limitations to present 
equations and derivations. Results are also usually too complex to be 
adequately summarized. Instead, words are used to say very generally what 
the models do, with some emphasis on strategies of analysis and presentation. 
I was able to check some of the simpler models in a spreadsheet; in all cases 
I could duplicate the reported results, and several times found additional 
patterns in the data, which I report. The process of the review also led me to 
reflect upon principles and standards of formalizations and simulations, which 
I discuss briefly at the end of the article. 


Mancur Olson and The Logic of Collective Action 


To the uninitiated, the term “collective action" would seem to have at its 
definitional core the idea that people do something together, and recent 
scholarship is beginning to return to this idea. But since the late 1960s, most 
literate social scientists have seen the core definitional content of the term 
"collective action" as a common or shared interest among a group of people. 
This view is derived from Mancur Olson's The Logic of Collective Áction 
(1965), especially his assertion that "rational, self-interested individuals will 
not act to achieve their common or group interests" (1965:2) and a chapter 
which offers a mathematical "proof" of this assertion coupled with a persuasive 
verbal description of the "free rider problem." 


OLSON'S CONTRIBUTION Prior to Olson, social scientists generally assumed 
that there was a natural tendency for people with shared interests to act together 


lFor a recent review of empirical research on "collective events," (which are a subset of the 
phenomena understood as “collective action") see Olzak 1989. 
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in pursuit of those interests, that is, that there was an unproblematic congruence 
between individual interests and group interests. Olson argued otherwise. 
Economists had long argued that rational individuals would not voluntarily 
contribute money to pay for public goods such as armies, legislatures, parks, 
public schools, or sewage systems, thus explaining why coercive taxation is 
necessary. Olson argued that all group goals or group interests were subject to 
the same dilemma. He defined a collective good as one which, if provided to 
one member of a group, cannot be withheld from any other member (called 
nonexcludability or “impossibility of exclusion;” Hardin 1982, p. 16).? 
Collective action was thus defined as any action which provides a collective 
good. Although this formal definition does not exclude individual actions, and 
the individual provision of collective goods is an important phenomenon, most 
scholars focus on actions that are behaviorally collective as well. 

Olson argued that if the benefits of a collective good cannot be withheld 
from nonparticipants, rational individuals are motivated to free ride on the 
contributions of others. Furthermore, he argued, this temptation would be 
greater the bigger the group, where the benefits of a contribution would have 
to be divided up among more people, and any one person's contribution would 
be less likely to make a noticeable difference in the outcome. Thus, he said, 
collective action is "irrational." If he had been a sociologist, Olson might 
have used this argument to launch a theory of the nonrational or nonindividual- 
ist bases of collective action. But since he was an economist, he argued that 
collective action must be accompanied by private excludable selective 
incentives that reward participants or punish nonparticipants, and he devoted 
the rest of his book to defending the empirical claim that such selective 
incentives can be found in a variety of historical instances of collective action. 

The importance of Olson's argument to the history of social science cannot 
be overestimated. Prior to Olson, social scientists typically assumed that 
people would instinctively or naturally act on common interests, and that 
inaction needed to be explained. Explanations typically took two forms. In 
one, the connection between interest and action was taken as so automatic 
that 1naction itself was taken as proof that there must not actually be a 
collective interest. In the other, inaction was explained in terms of individual 
"apathy" (which was, of course, indicated by the failure to act) or by some 
sort of communal deficit (of organization, solidarity, education, or resources) 
which prevented people from acting on their interests. 

After Olson, most social scientists treat collective action as problematic. 


?In the early work of Samuelson (1954), Head 1974) and others, “pure” public goods are 
defined by both nonexcludability and "jointness of supply." In later work, jointness of supply 
tends to be treated as a variable, regardless of the formal definition used. For a good summary 
of this issue see Hardin 1982 (pp. 17-20) 
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That is, they assume that collective inaction is natural even in the face of 
common interests, and that it is collective action that needs to be explained. 
This is the central problem for the “resource mobilization” (e.g. McCarthy & 
Zald 1973, 1977) and “political opportunity” (e.g. McAdam 1982) theories of 
social movements. (See also Snyder & Tilly 1972, Gamson 1990, Tilly 1978, 
Oberschall 1973, and Jenkins 1983 for greater elaboration of these theories than 
is possible here.) Instead of viewing “grievances” as automatically generating 
action, these theories stress resources, organizational capacities, and shifts in 
the polity as central to determining which grievances get acted upon. 


THE PROBLEM WITH OLSON'S PROBLEM The development of formal collective 
action theory begins with a critique of Olson. Plausible as it may seem, Olson's 
argument is wrong on its own terms. Selective incentives cannot logically 
solve the collective action dilemma. The problem is that somebody has to pay 
for the selective incentive, and paying for a selective incentive is, itself, a 
collective action in that it provides a benefit to everyone interested in the 
collective good, not just the people who pay for the incentive. Thus, the free 
rider problem adheres just as much to providing selective incentives to induce 
others to provide collective goods as to the original collective action problem 
(see Frohlich & Oppenheimer 1970, Frohlich et al 1975, Oliver 1980). If 
collective action were truly always irrational, then selective incentives could 
not solve the problem. 

More generally, Olson's equations and his verbal arguments entail a number 
of implicit assumptions, and it is the truth or falsehood of those assumptions 
that makes collective action rational or irrational. Different critics have focused 
on different assumptions and have explained the problems and contradictions 
in different ways (see Frohlich & Oppenheimer 1970, Frohlich et al 1975, 
Oliver 1980, Chamberlin 1974, Schofield 1975, Bonacich et. al, 1976, Smith 
1976, Hardin 1982, Oliver, Marwell & Teixeira 1985, Oliver & Marwell 
1988, Kimura 1989). A fair summary would be to say that there are many 
situations that give rise to "Olson's problem," but the scope of this problem 
is narrower than most people would understand by the phrases "collective 
action" or "collective goods." All theorists would predict with Olson that 
unconnected self-interested individuals will not spontaneously make small 
contributions which make no detectable difference in the provision of an 
expensive collective good, even if the sum of their contributions would benefit 
everyone. What the critics say is that, contrary to Olson's arguments, 
interdependence and coordination can change individual decisions even 
without private incentives, and that many collective goods can, in fact, be 
provided by a small number of individuals making large contributions through 
an appropriate technology (e.g. lobbying Congress). 

Olson's "group size" argument—that collective action is more unlikely in 
larger groups—has also been subject to extensive critique. (See Hardin 1982 


MODELS OF COLLECTIVE ACTION 275 


for a review of these debates.) In the formal collective action literature it has 
been repeatedly argued that this claim is either tautological or wrong. Olson 
and his critics agree that the real issue is whether an individual can make a 
noticeable difference in the provision level of a collective good. Olson 
confuses the issue: first he virtually defines a large group as one in which 
contributions are not noticeable (1965, p. 44), but then in a footnote he rejects 
the tautology and offers the empirical claim that noticeability is very unlikely 
in a larger group (1965, p. 49). It is this empirical claim that has been 
challenged. Nonexcludable collective goods usually have high levels of 
jointness of supply, in which the cost of the good is invariant with the number 
who enjoy it. When jointness of supply is high, there is no simple relation 
between the size of the group and the noticeability of contributions or the 
prospects for collective action (Bonacich et al 1976, Mitchell 1979, 1980, 
Hardin 1982, Oliver & Marwell 1988). 

Because this issue has become quite confused in the literature, it may be 
worthwhile to elaborate. Put simply, in some situations the group size effect 
will be negative, and in others positive. You have to know the details of a 
particular situation before you can know how group size will affect the 
prospects for collective action. If you are creating a model, vou have to put 
a group size effect into the model to get one out. If the theorist assumes that 
costs increase or individual benefits decrease with group size, there wil] be a 
negative “group size" effect on action. If the theorist assumes that the good 
has high jointness of supply, so that costs and benefits do not vary with group 
size, there will either be no "group size" effect, or it will be positive. 

Sometimes theorists build implicit group size effects into their models. A 
negative group size effect is forced whenever an individual's payoff is defined 
as something divided by group size, either as a proportion of a total share or 
as a function of the proportion of cooperators in a group. By contrast, defining 
the individual payoff as a function of the raw number of cooperators builds 
in a positive group size effect by implicitly assuming jointness of supply. In 
more complex models, positive or negative "group size effects" are often 
implicit in the details of a coordination or sanctioning system. 


The Purpose and Project of Formal Collective Action Theory 


The discussion of the "group size" effect implies a more general point. The 
most important result of twenty years of formal collective action theory is that 
collective action is not a unitary phenornenon. That is, the range of events 
reasonable social scientists subsume under the term "collective action" is much 
too complex and diverse to allow simple generalizations about its causes, 
effects, or dynamics. Titles like The Logic of Collective Action (Olson 1965) 
or The Mathematics of Collective Action (Coleman 1973) have been deeply 
misleading, even though their contents are illuminating. Questions like "Do 
people free ride?" and "Is collective action rational or irrational?" are too 
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general to guide research. More useful questions are: When do people free 
ride? Under what conditions is collective action rational? How is collective 
action coordinated? What factors affect outcomes of strategic interactions 
among collective actors? What are the important types of collective action? 

Collective action theorists increasingly view their task as using formal tools 
to illuminate processes and dynamics within particular classes of collective 
action. Implicitly, they are also helping to construct a typology of collective actions 
by identifying crucial factors that determine forms and trajectories of action. 

As formal theorists have moved beyond Olson's problem into the theoretical 
space it opened, they have implicitly returned to the older conception of 
collective action as something people do together. That is, although the 
existence of a collective interest is taken as the context within which collective 
action is studied, theoretical attention is devoted less to the interest itself as 
causative, and more to the social and organizational processes that make action 
possible. Most formal theories of collective action have some sort of individual 
decision by rational cost/benefit calculus at their core, but the question of 
individual versus group interest is not the major focus of their attention. A 
few major theories have no "interest" concepts at all, and instead employ 
cognitive processing or adaptive learning models. 

Models of collective action operate at different levels. The simplest and 
oldest focus on one individual at a time, and ask when that individual will 
act or contribute to some form of collective action. They collapse "the group" 
into a generalized other or a simple payoff equation. Many theoretical and 
empirical articles have addressed the conditions that affect individuals' choices 
in collective action. This review can only briefly indicate what is known about 
the individual predictors of participation. 

More complex models are concerned with the joint action of many 
individuals within one collectivity. In these models, the problem 1s whether 
individuals will be willing and able to coordinate their actions into a single 
joint action. For most scholars working in the area, this is the problem evoked 
by the phrase "collective action." Scholars have explored variations in 
heterogeneous distributions of important traits across individuals, structures 
of network linkages among them, and coordination mechanisms. Although 
most of these models work ın the cost/benefit paradigm, some instead are 
grounded in adaptive learning or cognitive processing psychologies. Most 
recent theoretical breakthroughs involve single collectivity models, and this 
review gives greatest attention to them, even though they are too new to have 
generated much empirical research. 

Another set of models focusing on a single collectivity may be called 
"collective decision" models. In these, the collectivity contains individuals 
with many interests who can engage in many collective actions, and the 
question is, which subset of actions will be jointly chosen by the members. 
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This problem of “legislative bargaining” in political science is beyond the 
scope of this review. However, the few sociological models within this 
tradition are reviewed; again there is little relevant empirical research. 

Finally, some models address not the actions of one collectivity at a time, 
but the strategic interactions between collectivities, particularly between 
movements and their opponents, usually conceived as states or regimes. These 
models ignore most within-collectivity complexities to permit consideration 
of interaction and strategy. These models have the greatest immediate potential 
of being linked to empirical research. 

These four levels of analysis are all simplifications of a complex process 
which includes all of them at once. We might imagine a long-range goal of 
putting all levels together in one enormous theoretical model, something 
comparable to physical science simulations of global weather patterns or the 
global environment. However, enormous models like these are still not 
producing reliable results for physical scientists, who are building on a much 
larger base of well-understood simple principles. Scholars of collective action 
need theory development and empirical research before contemplating grander 
models. We accept trade-offs. We simplify some problems to address tae 
complexities of others. 


INDIVIDUAL DECISION MODELS 


Models of a single actor collapse “the group” into an undifferentiated other 
in a payoff structure. Olson (1965) employs a single-actor model, and many 
such models are reformulations of Olson’s. Hardin (1982) provides a 
comprehensive review of these models through the late 1970s, discussing a 
wide variety of dimensions of collective action with special emphases on the 
“asymmetries” of action and the formation of implicit contractual solutions 
through conventions and norms. 

Since Hardin (1971) analyzed Olson’s problem as a prisoner’s dilemma, 
many scholars have used game theoretic payoff matrices to debate the likely 
outcomes of collective action, including those who argue it is not a prisoner's 
dilemma but a coordination or assurance game (e.g. Runge 1984). AII 
payoff-matrix accounts of collective action treat the "group" as a unitary actor 
and summarize the problem as eight payoffs in a matrix. In practice, payoff 
matrices are used to illustrate verbal arguments, rather than as analytic tools 


3Shofield (1975) uses characteristic functions to address Hardin's problem and shows that 
collective action should become unanimous if a coalition favoring action forms For recent debates 
on the utility of game theory for understanding collective action, see the special issue of Rationality 
and Society, Vol 4, No 1, 1992. This article also does not discuss the coalition-bargaming game 
tradition with its use of characteristic functions For overviews of recent theory and research cn 
game theory, see Kahan & Rapoport (1984), Komorita (1983), Michener & Potter (1981), 
Myerson (1991), Owen (1982), and Shubik (1982). 
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A great variety of individual decision models have organized much theory 
and research on collective action. In general, these provide an equation for 
the net payoff of participating in collective action as a function of the benefit 
of the collective good, the benefit of “selective incentives” and the costs of 
participation. They differ greatly in the details of the specifications of these 
benefits and costs. Authors rarely manipulate these equations mathematically 
to produce derivations or new results, but instead use them heuristically to 
organize a term by term verbal discussion of the determinants of participation. 

Although most economists and also critics of rational choice models (e.g. 
Gamson 1990) assume that individual decision models require “objective” 
costs and benefits, social scientists who use the paradigm are actually 
grounded in the psychological model of subjective expected utility, where all 
terms are assumed to be those subjectively perceived by actors. Klandermans 
(1984) calls for explicit use of the subjective expected utility model, with 
explicit recognition that these subjective terms change over time and can be 
influenced by intentional communications. Opp (1986, 1989) argues that 
subjective models are not tautological, because reliable survey techniques 
exist for measuring attitudes. Debates about whether subjective utility models 
are the best representations of the ways individuals think are beyond the scope 
of this review (e.g. Roemer 1978, 1979, Ferree 1992, Gamson 1992). 


Empirical Data and Individual Decision Models 


Individual decision models of collective action have had a great impact on 
empirical research. The level of interest in the collective good is most 
commonly operationalized with attitude scales measuring the intensity of 
opinion about a collective issue such as Cruise missiles (Klandermans & 
Oegema 1987), an upcoming union contract (Klandermans 1984), neighbor- 
hood problems (Oliver 1984), radiation exposure from Three Mile Island 
(Walsh & Warland 1983), or nuclear weapons (Opp 1988). Such scales 
typically have the predicted strong positive relation to measures of participa- 
tion, although they do not always distinguish token members from active 
participants (Oliver 1984). 

These theories generally have some term for efficacy, the perception that 
one's actions make a difference in accomplishing goals, which is the hope 
and urgency that marks historic moments of peak collective action (e.g. 
McAdam 1982). Research generally finds that participants in movement 
activities are more optimistic than nonparticipants about the prospect of change 
and about the efficacy of their participation, including African Americans 
between 1930 and 1970 (McAdam 1982), riot participants of the 1960s 
(Forward & Williams 1970, Paige 1971, Seeman 1975) and European social 
movement protests in the 1980s (Finkel et al 1989, Klandermans 1984, Opp 
1988). 
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In theoretical models, the potential for free riding is most commonly 
captured by multiplying the individual’s interest in the collective good by a 
term capturing the probability that the individual’s contribution will make a 
difference in the collective good. Empirically, the multiplicative model rarely 
yields a significant improvement over a linear model including the interest 
and efficacy terms.* Empirical research also suggests that while perceived 
collective efficacy is often important, the individual free riding dilemma is 
not. Walsh & Warland (1983) identified people who had an opinion about 
restarting the Three Mile Island reactor but were not participating in a group 
contesting the issue. They found few self-conscious free riders who were 
letting others do the work; most gave as reasons for nonparticipation their 
criticisms of the organizations, not having been asked to join, or lack of time. 
In a series of natural experiments, Marwell & Ames (1979, 1980, 1981, also 
Alfano & Marwell 1980) generally found that average “free riding” levels of 
about 50%; free riding was higher when subjects thought they were in larger 
groups. Self-reported individual efficacy levels are often highly implausible. 
Opp (1989) finds that movement participants attribute to themselves efficacy 
they believe the whole movement has. 

Olson’s concept of “selective incentives” has received important elabora- 
tion, with attention to the individual incentives that help to reward participation 
or punish nonparticipation. Oliver (1980) demonstrated formal differences 
between rewards and punishments as incentives. Most scholars today follow 
Clark & Wilson (1961) and James Q. Wilson (1973) in recognizing three 
broad types of incentives: material, solidary, and purposive. Material incen- 
tives are discussed by Olson and include salanes, insurance programs, and 
threats of physical or economic retaliation. Solidary incentives arise from 
social relations with other participants, such as praise, respect, and friendship 
or shame, contempt, and ostracism. Purposive incentives arise from internal- 
ized norms and values in which a person’s self-esteem depends on doing the 
right thing. These concepts of solidary and purposive incentives have 
permitted theorists to incorporate the influences of social networks and culture 
and socialization. Direct measures of solidary and purposive incentives have 
the expected positive relations (Opp 1988, Klandermans 1984, Klandermans 
& Oegema 1987, Knoke 1988). Research indicating the strong effects of 
networks and socialization on collective action also can be construed as 
evidence for solidary and purposive incentives (see Snow & Oliver 1993). 

“Cost” terms in individual decision models are more problematic. Higher 


^I base this claim on my own analyses and personal communications with other analysts. 
These nonsignificant results are rarely published, and most publications use linear models. One 
exception is Opp, who often publishes the multiplicative models on theoretical grounds and a 
comparison of R^s 
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costs should reduce action, but the data do not consistently show this pattern. 
Macro-level analyses of repression, resources, and political opportunities 
support a cost-benefit account, i.e. that there is more collective action when 
costs are lower and potential benefits are higher (McAdam 1982, Tilly et al 
1975, Tilly 1978, Oberschall 1973). But measures of subjective costs do not 
have the expected negative relation to action. Hirsch (1990) found that 
participants in a campus divestment protest believed they were bearing heavy 
costs and making sacrifices, while nonparticipants downplayed costs and 
assumed that participants were gaining intrinsic benefits. Opp (1988, 1989) 
found a similar pattern regarding assessment of costs and risks associated with 
antinuclear protest activity. Relatedly, people with more money or free time 
should have lower opportunity costs, but wealthy people give a lower 
proportion of their income to charity than poorer people (Sugden, 1984, 
Bergstrom, Blume & Varian 1986, Andreoni 1988, 1989, 1990), and people 
who are already busy are more likely to participate in a new activity (Oliver 
1984). 

We may summarize the empirical data about individual decisions by saying 
that cost/benefits models have predictive value when the right costs and 
benefits are included, but different costs and benefits seem salient in different 
circumstances, and no one has offered a coherent theory to predict these 
variations. 


PUBLIC GOODS ECONOMICS 


Economists and some political scientists investigate problems similar to 
collective action with the standard methods of economics, including plotting 
indifference curves and identifying equilibria, patterns of collective action 
which are stable in the sense that no individual would be motivated to depart 
from it in isolation. There is an enormous literature in the economics of public 
goods provision that cannot be reviewed here, but see Buchanan (1968) and 
Tullock (1974), Boadway & Wildasin (1984), Boadway & Bruce (1984), 
Johansson (1991) and Cornes & Sandler (1986). Sociologists should note that 
there are many “types” of public goods investigated with different models. 
Economists have explored the equilibrium solutions for different production 
functions with varying assumptions about interdependence and coordination 
(e.g. Cornes & Sandler 1983, 1984a,b, 1985a,b, 1986, Bergstrom & Comes 
1983, Cremer & Riordan 1985, Buchanan & Pinto Barbosa 1980) including 
weakest-link and best-shot rules (Hirschleifer 1983), provision of public goods 
by single individuals such as dragon-slayers (Bliss & Nalebuff 1984), contracts 
(Bagnoli & McKee 1991), matching rules (Guttman 1978), negotiation 
processes (Mas-Colell 1980), and profit-seeking political entrepreneurs 
(Frohlich, Oppenheimer, and Young 1971, Frohlich & Oppenheimer 1974, 
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Bagnoli & Lipman 1989); Laver (1980) critiques "political" solutions. There 
are models of optimal central taxation or allocation schemes for public goods 
(e.g. Groves & Ledyard 1977, 1980, Lau, Sheshinski & Stiglitz 1978). In 
the theory of clubs, individuals join together to provide a nondivisible but 
excludable good (Hillman 1977, Buchanan 1965, Ng 1973, Helpman & 
Hillman 1977, Zech 1982); Smith (1985) applies club theory to environmental 
lobbies. 

Economists also model pure altruism. They vary in whether people value 
only others’ happiness (Sugden, 1984, Bergstrom, Blume & Varian 1986), 
or whether they are also motivated by the “warm glow" of knowing they did 
the right thing (Andreoni 1988, 1989, 1990). These issues are explored with 
data on rates of giving by income, and the extent to which individual giving 
is crowded out by public welfare. This debate parallels collective action 
theory's distinction between valuing a collective good and the purposive 
incentive of doing the right thing about pursuing the good. 


COLLECTIVE DECISIONS 


For the problem of resolving competing interests within a group, Coleman 
(1973) develops a matrix model for what may be called collective decision 
making. He defines vectors of actors and outcomes, and matrices for each 
actor's control over and interest in each outcome, where actors lack full 
control over outcomes that interest them. Matrix multiplications represent 
bargaining or exchange processes that determine whose interests are 
achieved, the general principle being that control over events in which 
others are interested yields the greatest power. Standardization rules are 
applied to express interest and control as proportions of appropriate totals. 
Each outcome is assumed to have a probability of occurrence proportional 
to the total amount of power directed toward that event. Coleman (1988, 
1989) extends this mode of analysis to other problems, particularly to the 
"second order problems" of sanctioning and norm creation systems, showing 
(in line with Heckathorn's results, below) that such systems can produce 
zeal, the overprovision of collective goods. 

Based on his prior work on community influence systems (Marsden & 
Laumann 1977, Laumann, Marsden, and Galaskiewicz 1977), Marsden 
(1981) adds influence processes and networks to the basic model by creating 
a standardized actor-by-actor matrix to capture the extent to which each actor 
modifies his expressed interests in light of the interests of another. He shows 
that contradictory interests lead control or resources to be held out of the 
system. He also investigates simple patterns of interaction among influence, 
control, and interest distributions, showing how these affect the total level of 
resource mobilization in the system and the status and power of actors. 
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Several new models address the problem of interactions between collectivities, 
especially between movements and regimes. An important line of recent 
empirical work stresses interaction of movements and states in shaping cycles 
of protest. Simple early works include McAdam’s (1983) demonstration that 
peaks of protest activity cluster around tactical innovations, and Pitcher et 
al’s (1978) formal model of collective violence. Both argue that an innovative 
protest first catches the regime off guard, and thus diffuses because of its 
success, and then is brought under control by more effective social control 
or repression. McAdam’s (1982) political process model has never been 
formalized but is expressed as a diagram of causal flows and feedback loops 
which could be modeled. McAdam presents data to support his model. Recent 
work (e.g. Koopmans 1992) focuses on empirical evidence that recent 
European protest cycles begin with confrontative nonviolent tactics which 
increase in both moderation and institutionalization on the one hand and 
radicalism and violence on the other, a pattern best explained by shifts in the 
state's response to confrontative nonviolent actions. 

There are several recent formalized attempts to capture the complexities of 
dynamic interplay between movements and regimes. All are promising but 
flawed by problems of metric or specification. However, because there are 
also significant bodies of empirical data about collective events over time, 
this type of model might generate testable empirical propositions that could 
take metricized inputs from empirical cases. 

Chong (1991) develops a supply and demand model based on McPhee's 
(1966) model of consumer demand. Popular mobilization both expresses the 
demand for public goods and creates new demand; mobilization may stimulate 
opponent mobilization. The government responds to mobilization by supply- 
ing policies which may favor either proponents or opponents. Chong's regime 
is never repressive, only responsive to opponents. All three actors (proponents, 
opponents, government) have baseline tendencies to act which are modified 
by the actions of others. Factors captured by the model include bandwagon 
rates (mobilization due to prior successes), contagion rates, reactive mobili- 
zations, and differential regime sensitivity to proponent and opponent mobi- 
lization. There are extensive substantive discussions of the rationale for each 
relationship in each equation. However, many of these discussions explicitly 
argue for nonlinearities and interaction effects, while Chong's equations are 
all linear. Also, the specification of the model implies that groups are 
homogeneous. Chong does not address the question of how these simplifica- 
tions affect his results. 

Chong's effort is also marred by inattention to metric or normalization. 
Mobilization is in units of people and cannot have the same metric as 
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government policies. In the model, policy variables are always multiplied by 
coefficients which can bring them into the scale of mobilization, and Chong 
always works with these products in his examples, thus sidestepping the metric 
issue. It would be preferable, however, to address it directly so that his models 
could be more directly interpreted. 

Despite these problems, Chong's work is important. His mathematical 
sophistication is greater than most. He sets up his models symbolically and 
uses the mathematics of difference equations to solve for equilibrium 
conditions and time paths. Thus the parameters themselves are interpreted as 
variables which jointly define qualitatively different structural conditions 
which produce different patterns of results. Some structural conditions produce 
stable equilibria, others produce oscillations, and others produce spirals to 
infinity. These are potentially powerful results which should be pursued with 
nonlinear specifications of relations and specific treatment of the issue of 
metric. 

Opp (1991) models the interaction between state actors and collective 
actors, and explores the effects of a sudden grievance and the distributions 
of the original propensity to act among the actors. Opp's model is designed 
to capture the pattern of his empirical findings about the effects of repression 
in the German peace movement (Opp & Roehl 1990), in which the net 
effects are positive, due to the indirect “radicalizing” effects on incentives, 
despite the negative direct effects of cost on action. Unfortunately, it is 
impossible to determine the long-term utility of the model because its present 
form is fatally flawed. For no apparent reason, the range of costs is 15 
times that of benefits, thus making most net payoffs negative, a problem 
Opp deals with by fiat, declaring in defiance of theory that some negative 
net payoffs will lead to action. Slope coefficients for equations are similarly 
chosen arbitrarily. Opp's only comment on the choice of parameterization 
is that the model behaves differently depending on variable ranges and 
parameters, and that he picked those that seemed to him to give "plausible 
results" in terms of the time path of the model, i.e. those which fit his 
single empirical case. Though flawed in execution, this general model may 
be worth pursing more rigorously. 

Hoover & Kowalewski (1992, also Kowalewski & Hoover 1992a,b) focus 
on modeling the link between dissent and repression, arguing persuasively 
from a review of 101 empirical studies that states cannot be conceived as 
static "targets" and that feedback loops known to exist ın the relations make 
cross-sectional analysis misleading. In their models, movements and states 
have stocks of resources and grievances which are increased or decreased 
depending on their own actions, those of their opponent, and those of third 
parties. They distinguish between the scope and intensity of action, which 
seems similar to empirical findings about the difference between widespread 
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moderate action and violence committed by a radical cadre (e.g. Koopmans 
1992). 

They explore several key variations. The first is the nature of the feedback 
between movement actions and state actions, whether it is positive, negative, 
or zero and whether actors respond to intensity, scope, or an average of the 
two. They report that all their models converge rapidly to stable equilibria, 
and that positive feedbacks (mutual sensitivity) produce the highest levels of 
ongoing conflict (Hoover & Kowalewski 1992), and that being imperturbable 
to the opponent reduces it. They model the response of these equilibrium 
conflicts to external “shocks,” including war (Hoover & Kowalewski 1992), 
various conflict resolution interventions (Kowalewski & Hoover 1992a), and 
shifts in political opportunities (Kowalewski & Hoover 1992b). They also 
argue that both the state and the movement can be internally divided. They 
construct a typology of distinct orientations actors might have, show how 
different orientations lead to different strategic choices in different contexts, 
and identify the likely overall patterns according to how many of the 
orientation types prefer each strategy (Kowalewski & Hoover 1992b). 

Hoover & Kowalewski control metrics by setting all variables (grievances, 
resources, and actions) to the same range, 0 to 100, and interpreting their 
values as percentages of the maxima. However, all parameters in the model 
are specific numerical linear coefficients, most of which seem quite arbitrary 
and a few of which seem wrong. No attempt is made to assess the sensitivity 
of the model to changes of parameters, and all reported results are constrained 
by these choices. 


SINGLE COLLECTIVITY MODELS 


The greatest theoretical elaboration has occurred among models of the ways 
individual actions interdependently aggregate into a collective action. All these 
models assume a unitary collective action and single dimensions of interest 
(or reinforcement) and resources. They vary in assumptions made about 
individual choices, and in the structures of interdependence among actors. 


Critical Mass Theory 


Oberschall (1980, 1989) provides simple models for the effects of group 
heterogeneity on protest waves, but most of the recent emphasis on hetero- 
geneous groups arises from critical mass theory which emphasizes the role of 
actors who behave differently from typical group members (Oliver et al 1985, 
Oliver & Marwell 1988, Marwell et al 1988, Prahl et al 1991, Marwell & 
Oliver 1993). Oliver et al (1985) show how the shape of the production 
function defines different "types" of collective action with different dynamics. 
'They argue that strategic gaming is the potential problem for decelerating 
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production functions, while efficacy is the central problem for accelerating 
functions. They assume a common metric for costs and benefits, and they 
develop formal representations of concepts of surplus, order effects, indirect 
production, and contracts. Most results are analytic; numerical examples are 
taken as fixed, and the sensitivity of results is not assessed. 

Oliver & Marwell (1988) use verbal argument and one numerical example 
to show that Olson's "group size" claims are not genera] but contingent, and 
that most collective goods have high jointness of supply which produces a 
positive group size effect, although they refer to more complex analyses they 
do not report. 

Marwell et al (1988) investigate the interactions among group heterogeneity 
(both in interests and resources) and organizing cost, network density and 
network centralization, using an accelerating production function and an 
organizer-centered mobilization of an all or none contract. Findings are: (i) 
increasing heterogeneity around a given resource or interest mean improves 
success rates when the mean would produce failure and increases failure when 
the mean would produce success; (i1) organizing costs and network density 
have the predicted negative and positive effects, respectively, on collective 
action; (1ii) when groups are heterogeneous, network centralization increases 
the rate of collective action by increasing the probability that an organizer 
will be tied to a few large contributors. 

The core of the analysis is an experimental design and a Monte Carlo 
simulation. There are 6 possible values each for the two heterogeneity terms, 
10 for costs and density, and 19 for centralization which taken together define 
a6 x 6 x 10 x 10 x 19 design with 68,400 cells. Since it was impossible 
to generate all possible combinations of parameters (2,794 cases were 
generated across several months' time), parameters were themselves randomly 
chosen from uniform distributions across their ranges, thus yielding a 
representative random sample of the full design. A further random component 
is the generation of heterogeneous groups with the indicated heterogeneity 
and network parameters. Output from the simulation is analyzed with standard 
regression techniques. 

Oliver & Marwell (1991, also Marwell & Oliver 1993, Ch. 6) investigate 
the effects of information on the total contribution an organizer is able to 
mobilize. The problem is explored with two different models, each hinging 
on the fact that the mean of some top fraction of a distribution must be higher 
than the overall mean of the distribution. In both models, the cost of 
information is a reduction in the total number mobilized. Organizers must 
choose between having less information and mobilizing more people un- 
selectively, or having more information and mobilizing fewer people more 
selectively. Formal derivations are coupled with a variety of numerical 
examples. Noting that relative costs can dwarf other findings, the authors 
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report the general patterns that information is worth more as group heteroge- 
neity increases, and that there is an optimum information level which increases 
as group heterogeneity increases. 

Prahl et al (1991) explore the trade-offs between reach and selectivity in 
recruitment campaigns. Using an organizer-centered model and an accelerat- 
ing production function, they develop an equilibrium equation for the expected 
total contribution from a heterogeneous group with a given mean, and show 
how this expected contribution varies as the parameters vary. All results 
exhibit multidimensional thresholds that must be exceeded before collective 
action can occur. There 1s a critical point above each threshold where 
contributions rise either steeply or discontinuously with small increases in any 
factor producing large effects. All factors approach asymptotes above the 
threshold, and analytic expressions are derived for the asymptotes. Resources 
and interest have continually positive effects, while increases in the interest 
level after the threshold add nothing to the outcome. 


Sanctioning Systems 


The general issue of “second order” collective action and sanctioning systems 
has been extensively analyzed by Heckathorn (1988, 1989, 1990, 1991) in a 
complex formal model which takes off from Oliver (1980) and Coleman 
(1988). Heckathorn is particularly concerned with situations in which an 
external control agent imposes collective sanctions on a group, so that if any 
group member defects, all are punished. Absent the sanction, all group 
members prefer to defect rather than cooperate, and they do not harm each 
other by doing so. The question is whether the external sanction leads group 
members to impose internal sanctions on each other to force cooperation. The 
short answer is that sanctions can cut either way: group members may either 
enforce compliance or use their sanctions to enforce rebellion and resistance 
to the external control agent. 

Heckathorn's initial work (1988) uses static equations for one actor's 
decision at a time, to determine an equilibrium outcome under a given set of 
conditions, while later versions (1990) have set up complex algorithms of 
sequential decisions and iterations to equilibrium. The earlier work inputs a 
propensity to defect and calculates the value of using sanctions to prevent 
defection. The later work begins with universal defection and models actors 
actively choosing whether to use their sanctioning to enforce compliance or 
rebellion. 

Heckathorn's conclusions are complex and contingent. Interactions among 
sanction strength, group cohesion, and the mix of individual and collective 
sanctions determine whether a group is indifferent to the external agent, 
compliant, or rebellious. He finds divisions of labor within groups. Group 
cooperation often arises through "hypocritical compliance," using sanctions 
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to make others comply while defecting oneself, until there are enough 
sanctions to make everyone cooperate. Many groups retain this division of 
labor in equilibrium: some members cooperate with the external agent, while 
others bear the cost of the sanctions to enforce their cooperation. 

Heckathorn's (1990) model is a complex iterated sequence of decision 
algorithms, and he gives few analytic results for group outcomes. Numerical 
examples of different regions of the response surface are coupled with verbal 
descriptions of the complex interactions and dynamics and some reports of 
the model's sensitivity to initial conditions. He assumes a common metric of 
costs and benefits, and he devotes some attention to determining how the 
ratios of the various costs and benefits affect the results, although not to the 
ways sanctioning costs might depend on others' behavior. 

Heckathorn's (1988) analysis of sanctioning systems is organized around 
identifying the factors custodial agents might control. He reports that the 
stability of the compliance system is positively related to the strength of the 
external sanction, negatively related to the cost of control, and positively 
related to the degree of control among group members. Heckathorn also argues 
in this article that very small groups provide the best sanctioning systems. 
My own reanalysis of his equations replicates his reported results but also 
shows that the optimum group size goes up as the risk of a detected sanction 
goes down, especially when it is below 0.1. In this case, the conclusions 
about "group size" would differ if a broader range of conditions had been 
explored. 

Addressing similar issues with payoff matrices, Heckathorn (1991) dis- 
cusses the "altruist's dilemma" game in which egoists produce collectively 
rational outcomes, but altruists who try to maximize the other's payoff jointly 
produce collectively irrational outcomes. Heckathorn's best example is the 
"tragedy of the lawns," in which altruism or sanctions leads to a collectively 
irrational escalation in gardening standards. 


Stochastic Learning Models 


Macy (1989, 1990, 1991a,b) has explored the effects on collective action 
models of replacing the forward-looking rational actor with the backward- 
looking stochastic learner. Sometimes he argues that adaptive learning 1s a 
superior model and other times that it is an alternate model which can 
sometimes be more appropriate. Rational decisions are determinate one-shot 
choices that require intelligent processing of often-complex information about 
costs and benefits. Stochastic learning requires many repeated choice oppor- 
tunities but less intelligence, and additionally assumes that behavioral choices 
are not determinate, but probabilistic functions of the rewards and punishments 
of past actions. Macy's learning model is rooted in a conventional Bush- 
Mosteller stochastic learning model for binary choice (Bush & Mosteller 
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1955), and the work of game theorists and psychologists in applying this 
model to rats playing the prisoner's dilemma (Rapoport & Chammah 1965, 
Flood et al 1983, Gardner et al 1984). Miller & Andreoni (1991) also use an 
adaptive learning model to create an evolutionary game theory analysis of 
free riding experiments, which they argue better explains the observed 
experimental results. Macy consistently identifies analytic solutions and 
reports on the stability of his results under varying parameterizations. 

Macy (1990) applies his model to Oliver & Marwell’s treatment of the 
critical mass, using an S-shaped logistic production function and varying 
the jointness of supply. Unlike most collective action models where 
nonaction has a payoff of zero, Macy makes the baseline provision level 
itself a variable, so that escaping collective misery can be compared with 
collectively improving on an already good situation. In this model, the 
aversive consequences of the privativistic baseline leads actors to experiment 
with prosocial behavior. With low jointness of supply, these experiments 
are not rewarded and thus not repeated, but when jointness of supply is 
high, the stochastic process virtually assures a moment when enough people 
try cooperation to cross the threshold of the critical mass. At the threshold, 
actors are stochastically alternating between prosocial action and privatism, 
i.e. sharing costs. However, as action continues, the group divides into a 
permanent class of prosocial good providers and a permanent class of free 
riders. Everyone is better off than they were in the aversive beginning, so 
the division of labor is an absorbing state. The free nders are better off 
than the providers, but the providers are enough better off than they were 
that they do not want to change. 

Similar patterns arise when Macy (1989, 1991b) applies learning theory to 
the classic prisoner's dilemma. Although rational actors achieve mutual 
cooperation more easily, adaptive learners can achieve mutual cooperation if 
the negative consequences of mutual defection lead them to try cooperation 
at the same time. For adaptive learners, the key is some mechanism that leads 
them to synchronize their actions. 

Macy (1991a) builds on Granovetter's (1978) threshold model, again 
translating it into a learning model with probabilistic thresholds (instead of 
probabilistic behavior). Additionally, he explores the difference between 
parallel independent decisions and sequential decision, and he shows how the 
results vary across a wide range of specifications, comparing them with similar 
rationalist results reported by others. The experimentation that characterizes 
a learning model provides the stochastic possibility of creating a critical mass 
of individuals with zero thresholds. The model predicts initial inaction due to 
low rewards for action, but the accumulation of aversive consequences of 
inaction leads the group to jointly evolve to a critical state where a minor 
event, 1.e. a single random contributor, can start. 
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Simple Threshold Models 


A number of models of group action are based on a simplified problem in 
which each person's propensity to act is a direct function of the number of 
others who are already acting. The first, and most influential, was Granovetter 
(1978). Using a constant group size, so that percentages and numbers are 
equivalent, he assumes that each person has a threshold percentage of others 
who must participate before she participates: those eager to participate have 
thresholds of 0%, while those who will never participate have thresholds of 
100%. 

Granovetter’s most surprising claim is that minor perturbations in the 
standard deviation of a distribution produce massive discontinuous changes 
in the number of people acting, from about 6% to nearly 100% of the group. 
He claims that this result can be explained mathematically but not substan- 
tively, because it implies that this kind of collective action is virtually 
unpredictable, since tiny shifts in the heterogeneity of a group must happen 
all the time. However, my spreadsheet analysis of Granovetter’s example 
indicates that substantive interpretation is straightforward and plausible: it is 
an example of a critical mass model. 

The mathematical cause of his apparently shocking result is that Granovetter 
uses truncated normal distributions. Although negative numbers are meaning- 
less as thresholds, normal distributions take on negative values as the standard 
deviation increases, a problem Granovetter deals with simply by recoding 
them to zero. Thus, as heterogeneity increases, the pool of zero-threshold 
self-activators grows. The big jump occurs when the standard deviation is just 
under half the mean, 1.e. when the proportion of self-activators is approaching 
5%. Below the jump, only the self-activators act. At the jump, they suddenly 
are a large enough group to exceed the threshold for people in the low end 
of the bell, and action “takes off.” Thus the discontinuous effect is perfectly 
meaningful. If we keep our eye on the self-activators in a group, we ought 
to be able to observe the critical moment when the threshold occurs.° 

Granovetter’s analyses in the same article with uniform distributions, social 
structure, and spatial ordering point to the significance of zero thresholds and 
to the risks of "gaps" in a sparse distribution that might keep the dominos 
from falling. The domino image of cascading thresholds of action seems 
largely incorrect empirically, but one would generally expect clusters of 
people with similar thresholds and gaps between them. 


^A less significant problem in the same figure is that the decline in the upper equilibnum 
happens much more slowly than the figure implies, the sketch has clearly been drawn freehand 
without reference to the scale on the horizontal axis. Anyone who attempted to interpret this part 
of the graph would have been seriously misled, but it appears no one tried 
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A more recent model with a different orientation that also predicts behavior 
from the sheer number of participants is found in Naylor (1990). Built on 
Akerlof’s (1980) social custom model and Schelling’s (1978) discrete choice 
model, this model assumes that individuals gain some reputation-based benefit 
(i.e. a normative sanction) from participation which is proportional to the 
number of participants. (A collective good is assumed to exist but be irrelevant 
because individuals cannot make noticeable contributions.) Graphical solu- 
tions show the conditions for a stable equilibrium of participation. Naylor 
shows that if groups are homogeneous, the only stable equilibria are 
unanimous and zero participation, and that Schelling's (1978) dichotomous 
conclusions flow from the assumption of homogeneity. Consistent with Oliver 
& Marwell (1988), Naylor finds that heterogeneity reduces the size of the 
critical mass. Naylor & Cripps (1988) are cited as showing that the model is 
robust under changes in the distribution of normative incentives. If both 
joining and not joining evoke normative incentives (presumably from different 
sources), the rather obvious result is that any given individual will join only 
if more than half already have joined. Another modification has two levels 
of norms, one for those who believe in the custom, and another for those with 
a lower level of belief, which produces multiple equilibria. 

Glance & Huberman (1993) use statistical theory to develop an elaborate 
mathematical representation of a n-person prisoner’s dilemma in which 
payoffs depend only on the sheer number of contributors. Individual payoff 
is defined as total payoff divided by group size, forcing the negative group 
size effect claimed later as a result. Actions and information are prob- 
abilistic, and actors vary in the degree of time discounting of short-term 
defection payoffs versus long-term payoffs from building cooperation. These 
factors determine the fraction of other cooperators necessary to motivate 
cooperation, and the probability that this fraction is large enough. Dynamic 
analysis identifies the cutoffs defining four levels of increasing group size: 
(i) the one equilibrium for cooperation is optimum; (11) cooperation is 
optimum but defection is a local equilibrium; (11) defection is optimum but 
cooperation is a local equilibrium; (iv) the one equilibrium for defection is 
optimal. 

In this model, groups beginning at a suboptimal equilibrium (either 
cooperation or defection) tend to remain at it until a probabilistic congruence 
of behavior pulls them out of the local equilibrium and they move suddenly 
toward the optimum, a result is similar to Macy’s. Groups whose two 
equilibria are of equal optimality tend to flip-flop between them, their 50% 
average rates of cooperation disguising the fact that they are not dividing 
labor, but unanimously varying between the two. (This is different from 
Macy’s finding of role differentiation.) 
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Cybernetic Control 


McPhail (1991, 1993; also McPhail & Wohlstein 1986, McPhail & Tucker 
1990, and McPhail et al 1992) has developed cybernetic control theory 
(Powers 1973) models of the ways collective action is coordinated by 
individuals adjusting their behavior to bring their perceptual signals in line 
with a reference signal. This work is grounded in his impressive compilation 
of empirical evidence to support the claim that there are no behavioral 
differences in how people do collective action that distinguish the traditional 
categories of "collective behavior," "collective action," or “social action" 
(McPhail 1991) as well as his (1989) reading of Mead that a common focus 
of attention cannot be treated as unimportant or unproblematic. 

McPhail & Wohlstein (1986) differentiate forms of collective locomotion 
according to the ways in which they are coordinated, and they show 
experimentally that different instructions about how to coordinate action 
produce different degrees of coordination. For example, subjects did march 
to a common cadence when instructed to do so, but not when they were 
merely told to get from one point to another. More recent work uses 
simulations. McPhail et al (1992) develop a computer simulation for 
individual and collective action in temporary gatherings, describe how it 
works, and generate a wide variety of illustrative cases, each represented 
by a graphical “picture” of the group at various points in time. From 1 to 
255 individuals can be in the gathering, each operating with one, two, or 
three control systems (seeking a destination, avoiding collisions, and seeking 
the path of other individuals). McPhail (1993) uses this simulation to show 
how clusters, arcs, and rings form in crowds as a consequence of common 
orientations. 


955 


Consensus Formation 


Feinberg & Johnson (Johnson & Feinberg 1977, 1990, Feinberg & Johnson 
1988, 1990a,b) develop models for the emergence of consensus in crowds. 
The core of their model is the well-established empirical phenomenon of 
"milling" in a crowd, wherein people move around and talk with others near 
them. Johnson & Feinberg use the computer to create large groups of people 
with various distributions of initial opinions and susceptibility, who are 
divided into spatial subgroups. People are influenced by the opinions of actors 
near them; some models also include a central agent who is trying to control 
or influence the crowd. An important additional element (also consistent with 
the empirical literature) is that those who disagree with the emerging consensus 
tend to back away from the scene, permitting consensus to form among those 
who remain. Crowd sequences either end with dispersal, if enough exit that 
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too few remain to act, or end in some form of moderate or extreme action, 
if a high enough number of those remaining converge on the same choice. 

In their various analyses, Johnson & Feinberg explore the effects of the 
size and density of crowds, the mix of initial opinions and suggestibility in 
the crowd, and variations 1n the strategy of an agent who 1s trying to mobilize 
an extreme response from the crowd. Because they create “groups” which 
stay in the computer’s memory during execution, their models are large and 
time-consuming to execute. Most of their published results have been based 
on limited experimental designs. Johnson & Feinberg have only begun to 
explore the full implications of their models in terms of systematically varying 
factors that influence outcomes. Their work is entirely numerical to date and 
would be improved by formalization or sensitivity analysis. However, they 
have conducted experiments that vary the core parameters of the model. 

Johnson & Feinberg’s insight, that consensus is a product of both influence 
and exit, seems important for understanding the processes of action within a 
wide range of collectivities, especially organizations and also some informal 
networks or coalitions of organizations. Variations on their models are likely 
to prove useful for other substantive problems. For example, the construction 
and diffusion of a social movement ideology might work according to similar 
principles. 


SUBSTANTIVE CONCLUSIONS 


Recent models of collective action are diverse, representing attempts to 
capture different kinds of problems. One-actor models continue to be useful 
for simple empirical predictions of individual behavior, but they are providing 
few new theoretical breakthroughs. Models of collective decisions have great 
potential but do not seem to be the subject of much present work. Models of 
the strategic interaction between movements and their opponents are just 
beginning. These require much more development but will likely be enriched 
by empirical time series data. Recent empirical work suggests that it 1s 
especially important to model the ways states respond differently to different 
segments of the same movement, and to how these responses change over 
time. 

Most recent development in collective action models has centered on the 
problem of the interdependence of individuals within collectivities. Even here, 
there is not yet enough information to permit confident conclusions. Emerging 
results reveal complex interactions that prevent simple generalizations. 
However, there are some patterns. First, models with quite different assump- 
tions about individual decision processes and quite different assumptions about 
interdependence and coordination mechanisms all tend to yield predictions of 
thresholds and discontinuities, and predictions of internal divisions of labor 
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within collectivities. Something hke "critical mass" phenomena seem endemic 
to collective action. Second, the degree of group heterogeneity always changes 
results, but its effects vary: Sometimes it promotes action and sometimes it 
inhibits it. Clearly more research is required to understand the collective 
behavior of heterogeneous groups. 

Finally, plausible formal representations of interdependence and mecha- 
nisms of coordination have permitted scholars to reengage some of the classic 
problems of the coordination of group action. Having transcended the 
limitations of the "free rider" question, scholars are able to acknowledge 
Olson's signal contribution in problematizing mobilization and collective 
action and then move on to other problems. The "rational actor" still has a 
place in these models but now shares the stage with the adaptive learner, the 
target of influence, the probabilistic decision-maker, and the inscrutable 
person who 1s willing to make a particular contribution for reasons known 
only to himself. All these individual actors are brought together in a variety 
of ways to coordinate their actions. Different theorists have been working 
with different coordination mechanisms, but we do not yet have the capacity 
to step away from these distinct analyses and identify types of or patterns 
among coordination mechanisms, nor have we identified the interactions 
among coordination mechanisms and other important features of collective 
action, particularly production functions. As work proceeds, we should hope 
to create theoretical typologies of collective action situations, so that we might 
identify the types of situations in which, for example, particular coordination 
mechanisms would be most effective. There 1s much to do. 


APPENDIX: PRINCIPLES OF FORMAL ANALYSIS 


This review has assumed the value of formal theory without trying to defend 
it to the skeptical, and space does not permit such a defense. However, the 
review has revealed a number of issues relevant to improving the quality of 
formal theorizing. Most important is to move away from illustrative examples 
and elaborate case studies to experimental designs and response surfaces. In 
the process, theorists need to attend to issues of mathematical consistency and 
metrics or normalizations. 

The level of mathematical sophistication in the models reviewed is uneven. 
Some express equations symbolically and conduct standard mathematical 
exercises of solving for equilibria or creating reduced form equations. Others 
work only numerically, arbitrarily choosing numerical coefficients for equa- 
tions and ranges for variables. Some models are developed within standard 
mathematical approaches such as linear equations, difference equations, or 
statistical theory, while others arise from ad hoc attempts to model problems 
not easily represented in standard ways. There is a general need to upgrade 
purely numerical models to a more general symbolic form, and to attempt to 
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relate ad hoc models to mathematical approaches with known solutions or 
standard methods of approach.° 

Metrics and functional specifications are also a problem. Many of the 
reviewed works devote no attention to determining plausible metrics for the 
relevant variables, nor to normalizing variables to some metric-free standard. 
Without either meaningful metrics or normalized variables, results can be 
profoundly distorted and misleading. In most models, it should be possible 
to express some parameters as functions of other parameters, so the metric 
of one can define the metric of others, thus reducing the dependence of results 
on metric. Numerical results are meaningless without specific attention to 
functional forms, parameterizations, and the metrics of variables. 

The mathematical form of the functional relation between variables is often 
chosen arbitrarily and atheoretically, most often as some constant linear 
coefficient which has no theoretical justification, often even in the face of 
theoretical arguments that imply nonlinear relations. Even when theory guides 
parameterization, the review reveals insufficient attention to the sensitivity of 
models to choices in parameterizations. It is ultimately the parameters 
themselves that are of interest. We know that not all regimes have a 
responsiveness of 0.4 to mobilization (even if we knew the metrics of 
responsiveness and mobilization). What we want to know is how the degree 
or form of responsiveness of a regime to mobilization affects the trajectory 
of a system. Formal analysis of a model’s equations can often reveal the 
Critical interrelations among parameters that determine outcomes. If such 
results are unavailable, numerical analyses should be used to address the issue. 


Uses of the Computer in Theorizing 


There are conceptual differences among uses of the computer for formal 
models. The classic computer simulation is an elaborate case study 1n which 
all 1nitial parameters and relationships are pre-determined, and the model is 
simply turned on to see how it changes over time. Such simulations are often 
useful in engineering and some biological or physical science applications, 
where there are clear bounds on the problem and a small number of 
well-measured fixed inputs and well-understood physical relationships that 
undergird the model, but they rarely have more than heuristic value in studies 
of collective action. In the physical sciences, numerical computations are also 
used to check the accuracy of analytic solutions to complex sets of equations. 
There are no cases of this in the social sciences, presumably because those 
analytic solutions we have are relatively simple. 


°See Huckfeldt et al (1982) for pedagogic development of sumple difference equation models 
of mobilization, diffusion, and dyadic interactions which show how to derive solutions for such 
models. 
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There are two important uses of computers by collective action theorists. 
The first is to investigate the behavior of a determinate mathematical model 
under varying initial conditions. In this case, the inputs of interest are varied 
systematically to determine the multidimensional response surface which 
shows how the dependent variable changes as a function of the independent 
variables. The second is Monte Carlo simulation of models with probabilistic 
or stochastic elements. In these cases, random number generators are used, 
each case contains a random error component, and the same inputs may yield 
different results depending on the random elements, so enough cases have to 
be generated to wash out random noise. In collective action models, the 
principal uses of Monte Carlo techniques are for investigating the effects of 
group heterogeneity and for stochastic models of individual choice. It is 
increasingly common to see these two models combined, using Monte Carlo 
techniques as part of a larger experimental design to construct a complex 
response surface. 


From Case Studies to Experiments and Response Surfaces 


Formal analysts of collective action need to escape the mentality of the case 
study simulation or numerical example and think instead in terms of 
experimental design and response surfaces. À response surface is a multidi- 
mensional graph that shows how a dependent variable changes as a function 
of many independent variables. The statistical concept of a response surface 
assumes a single quantitative dependent variable, but the response surface is 
an appropriate metaphor for qualitative outcomes as well. Investigations of 
complex models should be grounded 1n an experimental design with ranges 
and values of independent variables chosen to give adequate coverage of the 
response surface of interest. Ideally, every simulation's parameters, variable 
ranges, and functional forms should be varied and tested across all possible 
combinations. Obviously, this implies enormously large sample sizes and is 
impossible at the limit, but too many projects have involved no sensitivity 
analysis at all. 

Experimental design is an appropriate way to conceptualize formal theo- 
rizing. Some things have to be held constant, and it is the theorist's job to 
identify these constant factors clearly as the assumptions and scope conditions 
of the theory. Within the theory, there are variables and functional relations 
among variables. If theory specifies the nature of the relation between 
variables, it can be treated as fixed, but if theory merely specifies some general 
dependence, the functional form itself may be considered a variable. Because 
it 1s impossible to vary everything at once 1n an experiment, some of those 
factors and parameters which the theory treats as variable will have to be held 
constant as others are varied. But if theorists view themselves as taking 
cross-sections from the larger conceptual response surface, they are likely to 
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choose their variables and discuss them in more sensitive and contextualized 
way. 

Attention to experimental design would also improve comparisons between 
different theoretical paradigms. Such comparisons are often impossible 
because models typically differ in a variety of ways, and it is difficult to 
isolate the effect of any one variable (e.g. rational choice versus adaptive 
learning assumptions). The case study mentality leads each author to choose 
a list of assumptions that seem most plausible. An experimental design 
mentality would lead authors to present more tightly controlled comparisons 
between models where as much as possible is held constant so that the effect 
of one factor can be identified. 

Although older numerical analyses were constrained to limited designs by 
real dollar charges per CPU minute, today the greatest impediment to working 
with experimental designs instead of case studies is figuring out how to record 
and analyze the tens of thousands of data points it is now possible to produce. 
There is a tremendous need for technical advances in the design and analysis 
of efficient complex many-variable experiments, in the graphical display of 
complex interactions, and in data reduction techniques for conveying patterns 
of results first to the researcher, and then to readers. We also need statistical 
theory on the extent to which standard data-analytic techniques (e.g. regres- 
sion, loglinear models, event history analysis) are appropriate tools for 
understanding the deductive relationships implied by simulations. 
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Abstract 


This paper departs from reviews of urban sociology that juxtapose competing 
Marxist and ecological theories, arguing instead that over the last 25 years a 
venerable social organization paradigm has been superseded by a fertile 
political economy paradigm that continues to orchestrate current research. 
The contributions of political economy are reviewed under six headings: 
historical explanation, comparative studies, socioeconomic processes, spatial 
relations, ethnicity and community, and political movements. The conclusion 
argues that urban sociology is currently in need of new directions. Some 
criticisms and alternatives are suggested. 


INTRODUCTION 


Sometime between 1968 and 1973, a revolution occurred in the field of urban 
sociology, a paradigm shift that became the predominant, if not the exclusive, 
orientation in the 1980s and continues to guide current work, even as some 
of its former enthusiasts Jook to new sources of inspiration. From its inception 
with the Chicago School in the early years of this century, urban sociology 
had embraced the social organization paradigm founded on the concepts of 
community organization and disorganization, ecological succession, and 
market-regulated social differentiation. In the late 1960s, a crisis developed 
in urban society and in the academy, particularly in the United States, 
England, and France where a new urban sociology soon blossomed. The social 
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crisis was composed, in different combinations cross-nationally, of racial 
segregation and conflict, poverty, urban riots, political mobilization of 
communities around employment and education, and efforts to replace urban 
renewal with affordable housing. The urban crisis energized social scientists 
who quickly discovered that the venerable paradigm of social organization 
failed to engage contemporary problems. The old theory was designed for an 
era of migrant adaptation, urban growth and differentiation, social mobility, 
and community. The crisis called for an explanation of growing inequality 
and social unrest. 

Ill-defined at first, a new paradigm began to emerge under the name political 
economy. In 1968, Manuel Castells published an article in Paris that asked 
“Ts there an urban sociology?" Drawing on the work of Henri Lefebvre and 
Alain Touraine, Castells claimed that urban sociology lacked the combination 
of theory and substantive focus necessary for an explanatory science (Castells 
1975). Castells's early writing in French reached a limited audience until the 
author took a position at the University of Wisconsin and Chris Pickvance 
(1975) published a translation and exposition of various French writers in the 
United Kingdom. In 1973, however, David Harvey, a British geographer 
working at Johns Hopkins University, published Social Justice and The City 
to an enthusiastic reception that clearly signaled that a paradigm shift was 
under way. Harvey's book, like the field itself, began with a set of “liberal 
formulations" such as central place theory. It found them inadequate to a 
structural explanation of urbanization and began to reconstruct the appropri- 
ately "radical formulations" that explained the coherence of urban form and 
process in the workings of capital accumulation. À new set of tools was being 
fashioned that promised to reveal the mechanisms of inequality and the 
interests underlying urban policy. For a new generation of urban sociologists, 
there was no going back. 

If European theorists hewed to a neo-Marxist interpretation of urbanization, 
North American converts inclined to a critical political economy that gave 
greater emphasis to the structure of political power than to the mode of 
production. Urban sociologists in the United States at the time were influenced 
less by Marx than by C. Wright Mills and Floyd Hunter. Inequality and 
political conflict were the metier of the new urban sociologists of the time 
(e.g. Piven & Cloward 1971, Miller & Roby 1970, Alford & Friedland 1975). 
Yet the emerging paradigm was interdisciplinary and intercontinental, a joint 
undertaking of neo-Marxists and left liberals, a growth industry in social 
science, and something of an intellectual community. Political economy was 
the phrase that best covered its diverse supporters. 

In outline, the events described and the general timing of the paradigm 
shift in urban sociology are substantiated by various observers. Sharon Zukin 
(1980) may have been the first to use the phrase "the new urban sociology" 


URBAN SOCIOLOGY 303 


in a relatively partisan discussion of developments in the 1970s. She 
characterizes the work as an approach that: 


Situates the new urban sociology within an equally emergent political economy, 
which requires urban sociology to be a more interdisciplinary enterprise (with 
economics and political science) than it has been. Political economy has directed 
them [political economists] toward inquiries about costs, prices, rents, taxes, 
wages; in short, toward the valuation process of capital itself. Finally, they have 
been critically re-evaluating the history of urbanization. Rather than merely 
document the successive emergence of urban forms (e.g. the change from the 
preindustrial to the industrial city, or tbe reproduction of metropolitan urban forms 
in colonial and post-colonial capitals), their historical analysis focuses on the 
hegemony of urban forms within social formations and the hegemony of 
metropolitan culture within the world system as a whole .. By tying together 
urbanization, the quest for profit and domination, and the state's attempts to 
moderate domestic conflict between social classes, the new urban sociology 
achieves a coherence the field lacked. (Zukin 1980:579) 


Written without knowledge of Zukin's forthcoming piece, a contempora- 
neous paper of my own entitled “The new urban sociology" discussed the 
now familiar origins of the approach and described it as follows, 


First, from a theoretical and historical standpoint it holds that urbanism itself 
requires definition and explanation rather than being taken for granted or treated 
simply as a phenomenon of aggregation. Urbanism and urbanization must assume 
the status of 'theoretical objects' in the sense of phenomena that arise (or do not) 
and take different forms under various modes of socioeconomic organization and 
political control. Second, the approach is concerned with the interplay of relations 
of production, consumption, exchange and the structure of power manifest in the 
state. None of these can be understood separately or as analytically prior except 
in the sense of a logical exercise. Third, as in the case of urbanism generally, 
concrete urban processes (e.g. ecological patterns, community organization, 
economic activities, class and ethnic politics, local government) must be 
understood in terms of their structural bases or how they are conditioned by their 
connection with economic exigencies, political arrangements, and the socio-cul- 
tural milieu. Fourth, the approach is fundamentally concerned with social change 
and conceives of this as growing out of conflicts (or contradictions) among classes 
and status groups. These conflicts are the basis of the political process, which 
increasingly is coincident with the arena of the state. Changes in the economy 
are socially and politically generated as well as mediated. Political and social 
changes are in no sense independent of the economy. Finally, the perspective is 
inextricably tied to the concerns of normative theory. It is concerned not only to 
draw out the ideological and distributive implications of alternative positions, but 
is critically aware of its own premises and the dilemmas they too pose (Walton 
1981:376). 


Although these descriptions were in general agreement and each was based 
on close study of the change, it was not long before other versions began to 
appear that rewrote the history. By the mid-1980s, urban sociology was being 
characterized as a field divided by "two paradigms, ecology and Marxism" 
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(Gans 1984:278; see also Logan & Molotch 1987:4—12). In fact, the 
juxtaposition of ecology and Marxism served the later authors as a device for 
typifying approaches with which they disagreed and as a springboard for 
introducing their own formulations. Worthy as those are, as we shall see 
below, the contrast did not accurately reflect developments in the field. 
Although human ecology has made valuable contributions to urban sociology, 
from Ernest Burgess and Amos Hawley to John Kasarda, there has never been 
an ecological paradigm that unified the field in a single set of assumptions 
and animated a professional community, in Kuhn's (1962) sense of the term. 
In this sense, a social organization paradigm existed, within which urban 
ecologists labored amicably alongside students of community, anomie, 
"pathology" (e.g. crime and delinquency), differentiation and succession, 
social area analysis, political pluralism, and so forth. This was the paradigm 
formulated by the Chicago School, advanced in a new generation of 
ethnographic work by Gans (1962) and Suttles (1968), and theoretically 
rendered in its last version by Greer (1962). And for all its genuine 
achievements, this was the paradigm that the new urban sociologists judged 
inadequate to the social and intellectual tasks of the late 1960s. The social 
organization paradigm, moreover, was succeeded not by neo-Marxism, but 
by a more nuanced and catholic political economy. 

The purposes of this chapter are to set straight this record of theoretical 
development, describe the major accomplishments of political economy, and 
consider recent criticism about the desirable directions of future work. In 
brief, my argument is that political economy has been the dominant paradigm 
for the past 25 years; like its predecessor ıt has produced valuable results; and 
by the mid-1990s the field is once again in need of a new direction. 


THE SUBJECT OF URBAN SOCIOLOGY 


Twenty-five years have passed since Manuel Castells (1975 [1968]) posed the 
question “is there an urban sociology?" and answered it in the negative. 
Castells's influential criticism gave pause to many urban sociologists, but 
discouraged few. On the contrary, the challenge to a peculiar yet dominant 
North American tradition issued by Castells and a number of European 
urbanists produced a generally sympathetic response concerning the need for 
a new urban sociology. The great force of the criticism lay less in its 
formalistic objection to a disassociation of "real and theoretical objects" than 
with its lucid and undeniable demonstration of the explanatory weakness of 
urban theory. In brief, Castells argued that urban sociology had no "real" or 
concrete object because "urban" referred only vaguely to things not rural 
rather than to some specific phenomenon. Rather, urban sociology (or 
"ideology") was concerned with certain activities such as how social groups 
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formed communities, created a subculture, and avoided anomie, none of them 
intrinsically urban. On the basis of such material sociologists produced a 
general sociology of integration, admirable in its own right but a peculiar 
approach to urban issues, as would become apparent later on when inequality 
and unrest rather than integration came to the forefront of city problems 
needing explanation. The problem was a divorce between the real and the 
theorized objects that defined cities. For his own part, Castells argued that 
there was a unity to urban phenomena that lay in the connection between 
spatial relations and the process of collective consumption [i.e. the consump- 
tion of nondivisible goods such as air quality or public transportation, which 
Theodore Lowi (1969) and Mancur Olson (1965) had previously called “public 
goods"]. Not surprisingly, Castells claimed that Marxist theory linked these 
concrete processes in a general explanation. The sociology of collective 
consumption is a scientific undertaking. Urban sociology is not. 

There was nothing new in the general effort to specify what is distinctive 
about cities in history that was rekindled in the late 1960s—Max Weber, 
Lewis Mumford, Jane Jacobs, and many others had previously addressed the 
question. Peter Saunders (1981:250) observes that "The history of urban 
sociology has been the history of a search for a sociological phenomenon the 
source of which may be located in the physical entity of the city. It has been 
the history of an institutionalized subdiscipline in search of a subject." 
Castells’s mock “astonishment” at the lack of a uniquely urban theoretical 
object, and his consequent characterization of the field as "ideological," was 
no more than a forceful rhetorical device for arguing, as others have before 
and since, that urban sociology would gain more coherence by focusing on 
a particular set of problems that lend themselves to explanation from a 
preferred theoretical orientation. For Castells, of course, the key unifying 
problem was "collective consumption." 

If we take Castells's contribution as no more or less than a valuable example 
of how urban sociology has developed, two important observations follow. 
First, such reasoning has typically made a strong case for looking at things 
in a new way, a case based on the fertility and problem-solving capacity of 
the proffered perspective. Invariably, however, it has also produced demurs 
in the form of alternative solutions to the problem that carry their own research 
and interpretive implications. David Harvey (1973, 19852), for example, 
joined in the conclusion that urban sociology was ripe for a Marxian theoretical 
revolution, but argued distinctively, and with at least equal force, that 
urbanization was best understood as the uniquely modern mechanism of 
capitalist accumulation. Reflecting on these developments, Peter Saunders 
(1981:256) concludes that “What emerges out of the consideration of the 
Marxist literature, therefore, is an appreciation that 'the' urban question is in 
fact two questions .. from Castells a theoretical concern with collective 
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consumption [and] from Lefebvre, Harvey, and others a theoretical concern 
with the function of space in the process of capital accumulation. These two 
questions are both valid..." 

The current predominance of political economy is demonstrated in the 
modern urban sociology texts (e.g. Light 1983) and in the most celebrated 
research synthesis to be published in the United States in several decades. 
Urban Fortunes by John Logan and Harvey Molotch (1987:17) is based on 
the political economy premise that “the market in land and buildings orders 
urban phenomena and determines what city life can be.” The key difference 
from Marxist and ecological conceptions is that Logan & Molotch understand 
markets as social constructions generated in the conflict between residents 
interested in the use value of property and entrepreneurs striving to realize 
exchange value in the “urban growth machine.” Responding to earlier critical 
discussions of the ill-defined subject, Logan & Molotch argue that by showing 
“how social factors shape prices of places and humans’ responses to those 
prices, we can understand the physical and social shape of cities. We will 
continue to study land markets retaining thereby the real objects of ecologists,” 
but with a theory of socially constructed interest conflict. 

With this framework, Logan & Molotch (1987:49) “strive to develop an 
authentic urban sociology” and in many ways they succeed. Their book is at 
once a refreshing interpretation of previous research and an extension of 
political economy. Yet the subject matter problem is solved no more 
definitively here than it was in Castells. A “political economy of place" based 
on socially constructed struggles in the market over alternative use of space 
is as much a societal, or rural, phenomenon as it is urban (as the authors 
tacitly concede in their analysis of the environmental movement that would 
qualify the struggle to save the Grand Canyon from air pollution as an urban 
issue). All of this suggests that the pursuit of a uniquely “urban object” may 
be misleading. 

Thus, the second observation: The proliferation of attempts to reconceptu- 
alize urban sociology suggests that there is such a field—an energetically 
contentious one at that. Although urban sociologists differ, the various 
exponents of modern approaches coalesce around a sociology of space—the 
processes in which socioeconomic change and inequality operate in and 
through space. No single formulation of the urban question has yet won 
universal and permanent allegiance, nor is any one likely to. The urban 
question is not an epistemological riddle that promises some timeless solution. 
Rhetorical devices must not lead to a confusion of theory (the explanations 
we invent for empirical regularities) and metaphysics. The urban question is 
rather a historical problem that lends itself to a variety of particular solutions 
that take the general form: “if we look at things in this fashion (e.g. cities 
promote capitalism), here are the problems that can be solved (e.g. alterations 
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in spatial form that correspond to changes in the organization of production)". 
Understood in that way, the proliferation of efforts to find a subject has made 
urban sociology more coherent than most fields. In response to the question 
"Is there an urban sociology?" a good many more complementary answers 
can be produced than would be the case if the question were put to, say, an 
economic, political, or medical sociology. 

Since the late 1960s, the urban question has been formulated in various 
ways that have a political economy orientation in common. What have we 
learned from the effort? 


THE CONTRIBUTIONS OF POLITICAL ECONOMY 


In the limited space available, I simply highlight the contributions of political 
economy with a set of themes and representative research. In most cases, the 
research comes from authors who would identify themselves as proponents 
of the new urban sociology or who approach the subject in ways that conform 
to the earlier characterizations. In the illustrations that follow, therefore, I 
have not imposed any framework on the authors cited or used their work to 
prové any points beyond those they intended. My claim is that political 
economy has been the most unifying and vigorous paradigm in urban sociology 
during the last two decades. That does not mean that all urban sociologists 
subscribe to the approach or that the only valuable research produced in the 
period fits within the paradigm. The following summary, therefore, chronicles 
the achievements of the prevailing style in urban research. 


Historical Explanation 


As the previous characterizations of political economy agree, urbanization 
and urbanism must be explained as contingent historical processes. Kingsley 
Davis (1955:429) has remarked that "compared to most other aspects of 
society—e.g., language, religion, stratification, or the family— cities ap- 
peared only yesterday, and urbanization, meaning that a sizable proportion 
of the population lives in cities, has developed only in the last few moments 
of man's existence." 

Urbanization is the demographic expression of agricultural commercializa- 
tion, industrialization, market centralization, and state formation. Although 
these processes have unfolded most rapidly under western capitalism, urban- 
ization on a grand scale has occurred in China, the former Soviet Union, and 
various developed and developing countries with mixed economies. If some 
advocates of political economy have erroneously equated capitalism and urban 
inequalities, others have demonstrated similar problems in socialist urbaniza- 
tion (e.g. Szelenyi 1983). 

David Harvey (1985a 1985b), one of the most important and certainly one 
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of the most prolific writers on the new urban sociology develops a rigorous 
Marxist analysis of how urbanization proceeds under capitalism, how capitalist 
urbanization shapes consciousness, and so also the terms of political conflict. 
Harvey’s first premise is that “urban” is not a proper subject of inquiry by 
contrast to the capitalist forms of urbanization that have a specific meaning 
and are not generalizable across societies. The urban process under capitalism 
arises from the interplay of accumulation and class struggle. Thus, the “built 
environment” is produced to serve production, circulation, and consumption. 
It changes in response to periodic and inherent crises which invigorate class 
struggles nascent in a continuous process of uneven development. What we 
see and reify as the city is a physical network of factories, stores, offices, 
schools, and roads, all hitched to the primary function of accumulating capital 
and all vulnerable to decay understood as devaluation (e.g. abandoned plants 
or railroads) stemming from failing profits and the obligation of capital to 
plow new circuits. Change results from crises which are costly in terms of 
devaluation and the resistance of those left bebind by the next cycle. 

The asperity of this framework, however, gives way to a complex narrative 
when Harvey analyzes the historical renovation of Paris after 1848 that led 
to the revolutionary Commune of 1871. The Second Empire of Louis 
Napoleon Bonaparte pursued a program of economic recovery founded on 
state investment in national and urban infrastructure. Paris annexed its suburbs 
and, with immigration, doubled its population in 20 years. The emperor 
appointed Baron Haussman to rebuild Paris, with a massive program of public 
works and liberal credit devoted to roads, rail networks, fashionable boule- 
vards, department stores, and the infrastructure of gas, water, and sewers to 
support them. "What was perhaps the first great crisis of capitalism was 
overcome, it seemed, through the long-term application of capital to the 
reorganization of space relations" (Harvey 1985b:70). But the reorganization 
threatened other banking and property interests, displaced the city's large 
working class through deindustrialization, and mobilized liberal professional 
groups all of whom supported the Commune's appropriation of the public 
space created by Haussman's reconstruction. After the bloody defeat of the 
Commune, the Basilica of the Sacred Heart was erected, a symbol of popular 
excess and bourgeois triumph. The study is organized around an underlying 
economic crisis and the contradictions of its resolution, yet the historical and 
symbolic elements of the analysis show that Harvey's explanation goes far 
beyond Marxist functionalism. 

In summary, there are two Harveys, or two levels in his work. At the level 
of grand theory, a Marxist analysis of capital accumulation explains everything 
that is important about urbanization, but it does so by equating urbanization 
with some of the many ways in which capital accumulation is facilitated. At 
the level of historical research, particularly the nuanced analysis of Paris under 
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the Second Empire, the contradictions of capitalist development provide a 
background against which urban renewal, political conflict, mobilization in 
space, and revolution all take place in related but unpredictable ways. 
Harvey’s purpose is precisely to set up this tension of theoretical logic and 
empirical contingency as a way of challenging our explanatory schemes. 

Harvey is not the only one to find the foundations of urban theory in 
nineteenth-century France. Charles Tilly (1974, 1975) explains the growing 
concentration of people and power in cities through the interrelated processes 
of capitalist development and state formation. On one hand, capitalist 
development, which is increasingly industrial in the late eighteenth and early 
nineteenth centuries, reduced the labor requirements of commercialized 
agriculture, attracted migrants to urban factories, centralized commerce, and 
concentrated wealth in cities. On the other hand, governments in the formative 
nation states allied with capital and created new urban constituents such as 
armies and civil servants. As resources were redirected to the urban center, 
provincial and agrarian interests suffered. In the area of food policy, for 
example, supply was appropriated for export to the cities, leaving the towns 
and villages with shortages. Prices paid to the producer were reduced in favor 
of the urban consumer. Food riots in the provincial towns were a characteristic 
symptom of centralization throughout western Europe. For Tilly, crucial urban 
processes such as migration, proletarianization, and collective action are 
explained by the interrelation of actors pursuing capitalist development and 
state formation. 

Urban political economy has also focused on historical change in the United 
States. The same theoretical contrast between Harvey and Tilly distinguishes 
the approaches of David Gordon (1978) and John Mollenkopf (1983). Gordon 
argues that urban development in the United States is divided into three periods 
(commercial, industrial, and corporate), each with its own economic priority 
and, accordingly, its distinctive urban form, function, and conflicts. During 
the first 200 years of US history, four commercial cities dominated: New 
York, Philadelphia, Boston, and Baltimore. The commercial city centered on 
the port, the business of the docks and shipping. The principal occupational 
groups included merchants, artisans, and laborers (sailors and longshoremen). 
Social classes and economic activities were closely integrated in space. 
Conflicts concerned customs, trade restrictions, and impressment of labor into 
merchant service which occasionally produced riots. The industrial city 
emerged after the Civil War in midwestern sites such as the prototypical 
Chicago. A few commercial cities such as Philadelphia made the transition, 
but for the most part the industrial cities were the new Pittsburgs and Akrons. 
A rapidly expanding labor force of European immigrants formed working 
class neighborhoods near the factories, themselves convenient to rail trans- 
portation outside the central business districts. The upper and middle classes 
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lived a comfortable distance from noxious factories. Segregation by race and 
class took root under these conditions. Conflicts concerned industrial strikes 
and the labor movement. Gordon argues that an important part of the shift to 
industrial decentralization in the modern corporate city began in the 1920s 
and was a deliberate effort to move industrial production away from organized 
working-class neighborhoods to the suburbs where strikes and union demands 
might be avoided. The corporate city introduced a new form of segregation 
that extended class and racial divisions across whole metropolitan areas while 
adding great differences in wealth and political power to the city-suburb 
divide. Conflicts now focused on differential quality of schools, employment, 
investment (redlining), public expenditures, and amenities. 

Mollenkopf would agree with this evolutionary pattern of American urban 
history, but he would explain the central processes and transitions by reference 
to the political realm. Urban policy has been the centerpiece of national efforts 
by political parties to develop coalitions that will put them in power. After 
the depression, the Democratic Party built an alliance of urban blue collar 
workers, black migrants to northern cities, catholics, and ethnic minorities. 
But Republicans in the 1950s were able to draw suburban support and the 
new defense industries. Johnson’s war on poverty and Nixon’s new federalism 
were subsequent chapters in the struggle over the “contested city.” 


Comparative Studies 


Perhaps the most distinct feature of urban political economy is its comparative 
breadth. The new work has attained a truly international perspective both by 
encouraging cross-national research and by attracting scholars from a variety 
of countries, including specialists on Third World urbanization. The leading 
journal publishing the new urban sociology is the International Journal of 
Urban and Regional Research. 

The previous social organization paradigm generated relatively little com- 
parative work, as an otherwise useful collection by Schwirian (1974) 
demonstrates in its paucity of international coverage and interpretations limited 
mainly to replicating US research designs (e.g. social area analysis) in other 
places. The new urban sociology was received enthusiastically and developed 
further by students of Third World urbanization (e.g. Roberts 1979, Portes 
& Walton 1976, Gilbert & Gugler 1982, Gugler & Flanagan 1978, McGee 
1967, McGee & Armstrong 1985). 

Underdeveloped and rapidly urbanizing countries provided a global labo- 
ratory for observing the effects of rural modernization on urban migration, 
dependent capitalism on the urban labor force, elite power on public policy 
as well as how the combination of these forces produces poorly served cities 
surrounded by squatter settlements. From the pioneering urban anthropology 
of Oscar Lewis (1959) to recent studies of squatters and their social networks 
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(Lomnitz 1977, Vélez-Ibafiez 1983), ethnographic work on Third World cities 
has added a comparative dimension to North American treatments of the urban 
village. But studies of Third World urbanization have moved outward and 
upward from city neighborhoods to questions of how national and regional 
economies affect the urban labor market, how cultural and religious divisions 
shape class-formation in the city, and how collective action such as labor 
movement protest is affected by these converging forces (e.g. Lubeck 1986). 
In a recent study with Charles Ragin, we demonstrated how the Third World 
debt crisis led to cross-nationally implemented economic austerity, or “struc- 
tural adjustment,” programs and how in turn these regressive policies 
generated a global wave of food riots and strikes in cities and among the 
urban poor (Walton & Ragin, 1990). When combined with case studies of 
the urban condition and political conflict (e.g. Seddon 1984), this kind of 
analysis provides a solid link between local insight and global generalization. 

Recently, comparative studies of urban political economy have turned their 
attention to what Friedmann & Wolf (1982) call “global cities,” those major 
international hubs of capital and information that seem more closely linked 
to one another than to their domestic hinterlands. Saskia Sassen’s (1991) book 
The Global City applies this kind of comparison to New York, London, and 
Tokyo. Sassen’s argument is that these cities epitomize a process of 
globalization in which the functions of finance and control of the economy 
become more concentrated in a few great cities while production is 1ncreas- 
ingly decentralized. The two trends are causally related: Greater concentration 
of financial and business services is required by the fragmentation and 
dispersal of manufacturing to nonmetropolitan subsidiaries, offshore plat- 
forms, and low-wage enclaves around the world. One familiar result of this 
process is deindustrialization in developed countries and the decline of old 
manufacturing cities such as Manchester and Detroit. Another is the growing 
polarization of the global cities themselves as the metropolitan labor force is 
divided between the well-paid professionals in law, trade, and finance, and 
the poorly paid service workers who cook, clean, and babysit for the 
professionals or who labor in the new electronic and apparel sweatshops— 
The fundamental dynamic posited here is that the more globalized the 
economy becomes, the higher the agglomeration of central functions in a 
relatively few sites, the global cities, leading to cities that produce mainly 
specialized services, a truncated national urban system, and polarization 
within the global city (Sassen 1991:5). Interestingly, Sassen’s comparison 
shows that the “fundamental dynamic” works as suggested in New York and 
London, but not in Tokyo, suggesting that cultural factors temporize the 
influence of capital on urban social organization. The general problem of 
economic restructuring and its urban implications is analyzed comparatively 
in a collection by John Logan and Todd Swanstrom (1990) which includes a 
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good theoretical introduction and a critique of certain trends in urban 
sociology. 

Finally, the end of the cold war during the 1980s has produced new 
opportunities in comparative research for scholars from eastern Europe and 
the west. This research has two foci, the more established studies of socialist 
urbanization (e.g. Musil 1980) and the newer work on cities after socialism 
(Harloe et al 1992). Szelenyi's (1983) work shows that social inequalities 
under socialism stem from centralized state policies, particularly those in 
support of heavy industry where employment grew more rapidly than it did 
in cities. Many of the contrasts between socialist and capitalist urbanization 
previously theorized from scattered sources (e.g. Giddens 1973) have been 
documented recently. In a useful review of this literature, Kennedy & Smith 
(1989) conclude that the countries of east central Europe are underurbanized 
given their level of industrialization by contrast to the west, that regional 
inequalities are less marked, and that low levels of urban primacy derive from 
the historical pattern in which the east was assigned the role of a food-pro- 
ducing periphery for the west. Doubtless east-west comparisons will be the 
most lively avenue of urban research in the next decade as investigators try 
to determine whether the earlier pattern of socialist urbanization will change 
to resemble western cities with the introduction of market economies. One of 
the great natural experiments in urban sociology is underway. 


Socioeconomic Processes 


One of the most general discoveries of the new urban sociology is the informal 
economy-—a variety of paid and cost-saving economic activities that operate 
outside the ambit of state regulation through households, social networks, and 
clandestine businesses. The informal "sector" (the term many prefer to 
"economy" which suggests dualism) was first noted by Keith Hart (1973) 
whose work in Accra, Ghana showed that many more people were employed 
or economically active than those counted in formal or census-enumerated 
jobs. The discrepancy was explained by the large number who made a living 
in various forms of officially uncounted self-employment, from street vending 
to prostitution and crime. Following Hart, a number of researchers demon- 
strated vigorous informal sectors in Latin American and Asian cities. And 
these authors took the next interpretive step by showing the crucial role of 
casual labor in subsidizing formal sector employers, including multinational 
corporations, and middle-class consumers. Although the formal sector benefits 
from the informal economy's cheap labor and available services, it external- 
izes the costs—that is the formal sector bears none of the costs of social 
reproduction. (Portes & Walton 1981, Bromley & Gerry, 1979, McGee & 
Armstrong 1985). The informal sector was defined by this feature; it was 
unregulated, meaning that workers enjoyed no protections, job-related bene- 
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fits, social security, or continuous employment. The absence of state or 
employer protections makes informal sector labor and services cheaper and 
more flexible, explaining why they have been on the increase with the 
economic development of Third World Cities. 

Finally, researchers have turned to the growing role of the informal sector 
or the "second economy" in the developed countries. Ray Pahl's (1984) 
Divisions of Labour demonstrated in a British sample the frequent use of 
informal services (but not of informal employment) and the irony that the 
more affluent were the greater beneficiaries of cost-saving exchanges of 
services. Two recent collections (Portes et al 1989, Redclift & Mingione 1985) 
describe the growing variety of informal work internationally, supporting the 
general proposition that casual, flexible, and low-wage work is replacing the 
formal sector, unionized, well-paid work of the industrial city. That, of course, 
dovetails with Sassen's study and suggests that the future of employment is 
a vital policy question for continuing work in urban political economy. 

My point here is that the new urban sociology has produced important 
discoveries by asking explanatory questions such as Hart's query about the 
discrepancy between official and actual employment figures and what all those 
"unemployed" were doing to survive. The question uncovered a complex 
process which we now call the informal sector. Similar processes explain such 
key problems as how industry has adapted to rapid technological change and 
decentralization and how households contribute to the reproduction of society. 
The processes are flexible specialization and the gender division of labor— 
large subjects that must be described here in a small space. 

Studies of flexible specialization in manufacturing (e.g. Piore & Sable 
1984) have been extended to urban studies with the question of what is in 
store for the "post-Fordist" city where manufacturing jobs and working class 
institutions have vanished. The literature on flexible specialization claims that 
manufacturing has reduced the size and dispersed the locations of firms 
capable of rapid response to changing consumer preferences and technological 
change. The computer industry and its origins in Santa Clara County, 
California, garages is the prototypical image. Allen Scott (1988) has studied 
the regional distribution of these industries (e.g. film-making, electronics, 
apparel) in Southern California, concluding that the production process, and 
specifically flexible specialization, generates an urban structure that is 
decentralized, mobile, and polarized. Critics suggest, however, that the pre- 
and post-Fordist divide oversimplifies matters, that industry has always been 
flexible, that certain inflexibilities are advantageous, and, generally that much 
more than the production process is at work shaping the modern or 
post-modern city (e.g. Sayer 1989). 

Finally, research on gender and the household economy demonstrates how 
families divide productive work, paid and unpaid, and how they persevere in 
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hard times, training and supporting family members in their roles as citizens 
and workers. Perhaps the most extensive study of the household in a 
developing society is the book by Henry Selby and associates (1990) based 
on participant observation and a survey of some 9000 Mexican households. 
Their principal discovery was the great resilience of extended families in 
crisis; their ability to double up home occupancy, increase the number of 
income earners, reduce expenditures, and survive physically if not culturally 
(that is, as undamaged persons). Related research in developing and developed 
countries demonstrates the great amount of society’s necessary and productive 
labor that is performed by women in the household economy whose work is 
not only unpaid but unrecognized as essential to the process of social 
reproduction. 


Spatial Relations 


A singular accomplishment of urban political economy has been the integra- 
tion of social and spatial processes. From the landmark research on human 
ecology (Hawley 1950) to modern theoretical formulations such as “distantia- 
tion” (Giddens 1981), sociologists have tried to incorporate the constraining 
fact of space into their analyses of society. Although the ecologists made 
important studies of land use and urban spatial “zones,” an effective 
integration with social action was not accomplished. 

It is only recently that a marriage of geography and urban political economy 
has made a breakthrough possible. Roger Friedland (1992:11) observes about 
some of this new writing: "in the last decade, we have begun once again to 
focus on space as an integral component of social structure and action. The 
social organization of space and the spatial organization of society have 
become two sides of a single preoccupation." 

Logan & Molotch (1987), as we have seen, represent this new approach 
in their effort to develop a "political economy of place." Space enters the 
analysis with the recognition that place has value that affects social action. 
Moreover, place may have different and conflicting (use and exchange) values 
that give rise to socially constructed markets that organize urban (and, one 
must add, rural) space. Geographer David Harvey is perhaps the most constant 
and insightful synthesizer, as he shows in a lucid essay on “Money, time, 
space, and the city" (Harvey 1985b:1): "I argue that the very existence of 
money as a mediator of commodity exchange radically transforms and fixes 
the meanings of space and time in social life and defines limits and imposes 
necessities upon the shape and form of urbanization." Sociologists have yet 
to develop the full implications of the socio-spatial vision found in writers 
such as Harvey, yet urbanists bave been dealing with issues of space all along. 
As Harvey (1985b:19) notes, "Gans thus brought to life in The Urban Villagers 
what had been evident to close observers of the urban scene for many years; 
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that the struggle to create protected places and communities was as fiercely 
fought in urban areas as its evident loss was felt in rural areas.” 


Ethnicity and Community 


It may come as a surprise that research on ethnicity and community should 
be named as one of the strengths of urban political economy. Surely this is 
the centerpiece of the social organization paradigm, the focus of classics from 
Drake & Clayton’s (1945) Black Metropolis to Kornblum’s (1974) Blue Collar 
Community. I agree with the latter point but suggest that the new urban 
sociology has added a vital ingredient to the study of ethnicity and community 
by locating those local processes within the larger economy, labor force, and 
political environment, and by integrating these levels. That is, beyond 
employing these broader forces simply as background, the new work attempts 
to follow structural influences into the processes of ethnic identity formation 
and community action. 

Earlier work often understood this idea. An important, 1f seldom noted, 
theme in Gan’s (1962) study of Italians in Boston was that ethnic identity was 
becoming submerged in class consciousness. In a related vein, Yancy et al 
(1976) argued persuasively that ethnicity was "emergent," a product of the 
situations in which nationality groups found themselves as vulnerable minor- 
ities, rather than some intrinsic identity or essential cultural trait carried by 
immigrants. It was particularly in the urban setting that stratified labor 
markets, segregated housing patterns, and powerful political machines all 
combined to create new ethnic identities for groups introduced to the city in 
different times and conditions. 

Using the case of Koreans in Los Angeles, Light & Bonacich (1988) expand 
this approach into a comprehensive model for the study of ethnic minorities. 
The group's migration history is first analyzed to determine education, skills, 
reasons for moving, and how the timing of the sojourn is affected by conditions 
in Korea and Los Angeles. Locational preferences in the United States are 
examined as a product of past migrations and enclave settlements. Korean 
immigrants are then located within the Los Angeles labor and housing markets 
at the time of their arrival. The intersection of these factors helps to explain 
how a close-knit Korean community took root in the southwest and south 
central area of the city, a community based substantially on small retail and 
service businesses. These ethnic enterprises flourished by employing family 
labor, working long hours at low wage rates, and retailing the goods and 
services of the major US corporations (e.g. in gas stations, fast-food outlets, 
grocery stores) in new and expanded markets in their own and adjacent African 
American neighborhoods. Light & Bonacich conclude that ethnic business 
actually provides "cheap labor" for the major national firms. Small businesses 
are interpreted as labor in the sense that Korean immigrants exploit themselves 
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and their extended families in the process of creating markets that would not 
otherwise exist unless someone were willing to maintain retail outlets in ghetto 
areas, or to work all hours of the night in high crime areas for small profits. 
In the process, moreover, Koreans attract the resentment of African and Latin 
Americans who may lack the capital or experience to operate a business and 
who see Koreans as exploiting them. Indeed, during the 1992 Los Angeles 
riot, “more than two thousand Korean-owned businesses were destroyed or 
damaged . much evidence suggests, moreover, that the Korean merchants 
who own and run a large number of the liquor stores in South L.A. were 
selected as special targets of violence." (Rutten 1992:52). In addition to 
describing the family and associational life of the Korean community in Los 
Angeles, the Light & Bonacich study provides a profound explanation for 
urban conflict. 

The best-known recent book on urban minorities is certainly William Julius 
Wilson's (1987) The Truly Disadvantaged. Wilson would probably not 
consider himself primarily an urban sociologist, and his work is not explicitly 
anchored in the urban political economy tradition. Nevertheless, Wilson's 
analysis of the urban underclass follows a similar approach, moving from 
structural changes 1n the broader economy to local social effects. Blacks and 
Hispanics migrated to the cities in great numbers just as the urban economy 
was shifting from manufacturing to services. Blue-collar jobs that absorbed 
earlier waves of European migrants were being replaced by less stable or 
rewarding jobs in services such as eating and drinking establishments, health 
care, and retail stores. The new urban minorities faced shrinking employment 
opportunities, repeated sharp recessions from the 1970s to the early 1990s, a 
decline in neighborhood institutions such as schools and churches, and 
growing economic competition among themselves (e.g. between Koreans and 
African Americans). The changes conform to Sassen's description of social 
polarization in large cities and its economic causes. The result, according to 
Wilson, is a steady undermining of social institutions and the creation of a 
disadvantaged underclass. 


Social and Political Movements 


Ás something of a social movement in its own right, urban political economy 
was born in the midst of contention during the late 1960s and has always 
taken collective action as its metier. Castells’s early work addressed social 
movements in Paris (Pickvance 1975), and his research evolved within a few 
years to the point that he abandoned structural Marxism and the issue of 
collective consumption in favor of popular movements as the key agent in 
urban change (Castells 1983). 

In the United States, Piven & Cloward (1977) analyzed poor peoples' 
movements with an explicit political economy which attributed periods of 
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success in the labor, civil rights, and welfare rights movements to the effect 
of urbanization on shifting political coalitions at the national level. Indeed, 
their pessimistic, even prophetic, conclusion was that popular movements 
enjoyed only brief windows of opportunity to influence policy and were best 
advised to battle disruptively for whatever they could get before the coopting 
structures of politics enveloped them again. Norman and Susan Fainstein 
(1974) writing at the same time developed a kindred approach based on the 
structure of power and the volatility of insurgent challenges. Their focus was 
specifically urban, analyzing movements for community control, planning, 
schools, housing, and so forth. 

Much of the work in the field of social movements, of course, has flourished 
under the influence of resource mobilization theory (Jenkins 1983) which, 
despite Charles Tilly’s formative contributions, has not dealt uniquely with 
urban issues. A new book by Margit Mayer (1992), however, develops an 
important approach based on the state and social movements. Comparing 
urban conflicts in the United States and Germany, Mayer argues that the 
timing, resources, and prospects for success of movements derive in large 
part from the political environment created by the state. 


CRITICAL REFLECTIONS 


The success of urban political economy lies in its capacity for structural 
explanations of a variety of urban conditions. The structural logic of capitalist 
accumulation, for example, requires an expanding metropolitan market linked 
by transportation and infrastructure as well as an adequately housed labor 
force within economical commuting distance of factories and offices. These 
requirements embody social class contradictions (e.g. over land use, rent, 
access to public services) that occasion social struggles (e.g. between capital 
and labor or between property and industrial capital). State mediation of these 
conflicts follows the contours of political power that describe each conflict. 
By way of analysis, this imposing framework is projected on to the physical 
and social landscape. Assumptions about causality and the potency of various 
elements of the framework are prefigured in the structural argument. Action 
flows along one of several predictable courses depending on particular 
combinations of antecedent structural conditions. Social movements hatch 
from these structural contradictions, not from other sorts of contradictions 
independent of the framework and not from non-contradictory conditions of 
urban life. Consciousness, too, is a reflex of structural arrangements. In short, 
the very success of urban political economy begins to suggest its limitations. 
Densely developed theories of structural causation have laid to rest earlier 
anecdotal (community study) and metaphorical (human ecology) treatments 
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of the city, and in the process may have laid themselves in a new procrustean 
bed. 

Despite its impressive accomplishments, or because so many of them go 
back a decade and more, urban political economy seems to be flagging of 
late. Work on global cities and the transition from socialism is thriving, but 
research in other areas has become repetitive. 

Two problems have overtaken the field. First is the problem of overconfi- 
dence; the accumulation of a sufficiently impressive body of theory and 
research that practitioners prefigure answers to new questions. A cocksure 
analytical style begins to enter the writing in which developed theory is used 
implicitly to "read off" interpretations for empirical events. A genuine sense 
of puzzlement is seldom encountered. Second is the problem of economism: 
the tendency for political economy to become enamored of the seeming causal 
potency of economistic analyses—to collapse complex social issues into 
elusively precise technical and organizational terms. 

Ray Pahl (1989:709) has made a similar criticism about the "inevitable 
danger in any field that after the initial conceptual push many may feel inclined 
to embrace the new paradigm and doze a little." Pahl adopts from Giddens 
(1979) a model for analyzing society that explores the connections between 
structure, consciousness, and action—the SCA model. The trouble with urban 
sociology these days, he claims, is that it has stopped looking for these links, 
preferring instead to plunk down prefabricated concepts like such as social 
class in which the connections, for example, between location in the social 
structure and interest-based action is presumably given. 


My contention has been that scholars have behaved as if the links in the SCA 
chain were self-evident. Once it is understood that such links have yet to be 
discovered an important programme of research designed to advance theoretical 
understanding can be opened up. From informal social networks through families, 
kinship links and the whole range of formal and informal associations of civil 
society people are engaging 1n voluntary solidaristic and collective activity for a 
variety of goals. (Pahl 1989:719) 


Pahl's confidence in the SCA model is no doubt well informed, although 
other researchers might wish that Giddens had demonstrated its empirical 
mettle. The idea, nevertheless, resonates strongly with those of us who have 
become weary of the hypostatizing habits of political economy. In my own 
recent work, for example, I have developed a framework for explaining social 
change based on the interaction of state, culture, and collective action (Walton 
1992). The analytical aim is basically that of Pahl's suggestion—to show how 
action is based on meanings constructed under conditions of choice and 
constraint. 

Whether political economy finds a second wind in these critical recommen- 
dations or is replaced by some new paradigm are questions beyond our control. 
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New syntheses based on socially constructed value conflicts in tandem with 
an expanding empirical terrain suggest that the new urban sociology will be 
with us for some time to come. But the reckoning has already begun. Culture 
and action will guide the next generation of research. 
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Abstract 


Macrocomparative research has a broad reach within the discipline. We review 
recent books and articles engaging in comparison, either of nations or of 
structures and processes within two or more nations. We focus on how 
macrocomparative sociologists address the threats to validity which surround 
sample selection, research design, and data and measurement. Nearly all the 
studies we investigate have explanation as a goal. A major threat to many 
studies (almost a third) is the failure to apply any method of control. Most 
authors wish to draw general conclusions, but problems of sampling raise 
questions about their ability to do so. Researchers agree that measurement is 
problematic. However, issues surrounding measurement are largely neglected. 
We describe these and more specific empirical findings regarding the practice 
of macrocomparative research. 


INTRODUCTION 


Interest in methods of macrocomparative research is at a high point. Articles 
devoted to the topic have appeared in recent issues of the major sociological 
journals (e.g. Lieberson 1991, Kiser & Hechter 1991); Kohn’s 1987 ASA 
presidential address focused on it; and several recent works have advised 
sociologists on the practice of the craft (Ragin 1987, Skocpol & Somers 1980, 
Tilly 1984). Shifting our attention to works that apply macrocomparative 
methods, the centrality of these methods and the reach of macrocomparative 
research is even more impressive. Between 1985 and 1990, over 200 nonedited 
books in this tradition were reviewed in Contemporary Sociology. Over the 
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same period, more than 80 macrocomparative research articles appeared in 
top sociology journals. 

We credit this emphasis to diverse factors. The collapse of communism in 
the former Soviet Union and its satellites, the trend towards democratization 
worldwide, the continually growing political and economic importance of the 
Pacific rim, and an increasing awareness of the interdependence of nations— 
all challenge sociologists to think about social change at the macro level. 
Within sociology, the rise of world system and dependency theory has 
heightened interest in macrocomparative research, and the resurgence of 
interest in comparative historical sociology has reinforced this impact. Also 
relevant is the explosive growth of international data series produced by the 
United Nations, the World Bank, and others. The internationalization of 
sample surveys has also furthered interest in cross-national comparison. The 
World Fertility Survey was an early effort along these lines. Other, more 
recent examples include the European Values Survey, the Demographic and 
Health Surveys, the international counterparts to the US General Social 
Survey, and the Living Standards Measurement Surveys. Taken together, 
these factors have led sociologists from disparate orientations to a common 
concern with methods of macrocomparative research. 

Our assigned task is a critical review of how sociologists recently have 
gone about comparing nations, or structures and processes within two or more 
nations. We do not restrict our examination to "classic" works in the field. 
This differs from the common strategy of critically examining only a few 
well-known studies. To the extent possible, we focus on the literature as a 
whole and comment on its strengths and weaknesses. Individual studies are 
identified mostly to illustrate particular points. No study is perfect, and indeed, 
our own contributions to the literature are subject to some of the same 
criticisms we raise. Our hope is to inspire a reexamination of methodological 
practices ın the field generally, just as the process of conducting this review 
has caused us to rethink some of the assumptions underlying our own work. 

When we agreed to undertake this review, our initial idea was to form a 
typology, an honored tradition (e.g. Przeworski & Teune 1970, Vallier 1971, 
Smelser 1976, Elder 1976, Skocpol & Somers 1980, Tilly 1984, Jackman 
1985, Ragin 1987, Kohn 1987). However, with its emphasis on categories 
and qualitative differences, for us the typology became a Procrustean bed. 
The quantitative-qualitative distinction so familiar to sociologists also turned 
out not to characterize the literature in any meaningful way (also, see Smith 
1991). Although the typologies shape our discussion, our review reveals much 
more continuity than discontinuity in the literature, much more similarity than 
dissimilarity. 

Rather than emphasize differences, we review and assess responses to a 
common set of methodological problems, focusing on threats to valid 
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inference (cf Cook & Campbell 1979). Regardless of style of analysis, every 
study must consider the possibility that some process other than the one 
hypothesized accounts for the results obtained; that measures are not valid, 
reliable, or comparable; and that case selection influences conclusions. Failure 
to consider these issues can undermine the value of macrocomparative 
research. We describe the recent literature in terms of how authors respond 
to these threats. In anticipation of our findings, a disturbingly large proportion 
of the studies examined inadequately treat these issues. 

The next section explains our selection of studies for this review. It sets 
boundaries to the literature we include and limits the generalizations we can 
make. It also demonstrates the broad reach of macrocomparative research 
within the discipline. The sections that follow consider sample selection, 
research design, and data and measurements in the studies we have selected. 
These sections identify commonalities in the practice of macrocomparative 
research, methodological weaknesses, and problems deserving more attention. 
They are the heart of our critical review. In the conclusion, we consider 
potential shortcomings of our review, summarize its major points, and suggest 
areas where literature needs improvement. 


LITERATURE SURVEYED 


We define macrocomparative studies as comparing two or more nations.' The 
comparison may involve global, aggregate, or individual-level structure or 
process so long as it draws on the experience of more than one nation. Our 
definition is broad enough to include studies of the world system, in which 
comparison of nations is accomplished within the context of a global system 
of relations, and also of preindustrial societies, in which the unit of analysis 
is generally subnational in the modern sense of the term. À second criterion 
is that comparison of nations must be a primary purpose of the work. Research 
making only incidental comparisons not central to the analysis is excluded. 
À third criterion restricts our examination to comparative works by sociolo- 
gists, or works likely to be read by sociologists. We focus on the recent 
sociological literature, recognizing that many fine examples of macrocompara- 
tive research thereby will be omitted, including many “classics.” It would be 


I There 1s debate as to whether macrocomparative should be defined in terms of methods (e.g. 
Smelser 1976) or substantive focus (e g Ragin 1987). Authors also vary in how broadly or 
restrictively they define the topic. For example, Przeworski & Teune (1970) take the narrowest 
approach when they limit comparative research to studies that compare microlevel relationships 
in two or more social systems. With respect to the definition we propose, the emphasis on 
comparison of nations is somewhat restrictive since macrocomparative research may compare 
structures and process within as well as between nations (e.g. B1esel 1990) We retain the focus 
on nations to make our task more manageable 
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ideal to know the universe of works meeting our criteria. Macrocomparative 
research appears in a myriad of journals in sociology, history, economics, 
and political science as well as in books catalogued under a variety of topics. 
We know of no way to locate all relevant papers and manuscripts by and for 
sociologists. We must sample. 

We developed separate strategies to identify and sample macrocomparative 
research published in books and journals. Indeed, there may be interesting 
methodological differences in the work published in these two forms. We 
compiled our list of macrocomparative books by taking all nonedited volumes 
reviewed in Contemporary Sociology (CS) between 1985 and 1990.” CS is 
the major book review journal in the discipline and reviews most macro- 
comparative books of interest to sociologists. Clearly, there will be omissions 
and some books of little interest will be included, but we can think of no 
better single way to develop a list of relevant books to represent the diversity 
of styles of macrocomparative research now current in the discipline. Our 
search identified 209 nonedited books on which we gathered some basic 
characteristics. It was impossible, however, to read so many books to obtain 
detailed information for our critical review. We sampled again, concentrating 
on two substantive areas in which we had prior expertise and which were the 
subjects of a large number of books: demography and development. In these 
two areas, we read every book on our master list and classified the research 
described in each in terms of sampling, design, and measurement.? We make 
no claim that works in demography and development are representative of 
the literature more generally. When possible, we assess the sensitivity of our 
results to the exclusion of works in other areas. 

Our list of macrocomparative articles includes every macrocomparative 
piece appearing in the American Sociological Review (ASR), American 
Journal of Sociology (AJS), and Social Forces (SF) between 1985 and 1990 
and every paper by a sociologist published in Comparative Studies in Society 
and History (CSSH) for the same period. The first three are top journals in 
the discipline; they publish research in diverse theoretical and methodological 
traditions, and they have wide circulations. However, we were concerned that 
focusing only on these journals might favor certain kinds of pieces (e.g. 
quantitative studies). We added CSSH, an interdisciplinary journal that is an 
outlet for some macrocomparative sociological research, for this reason. Based 


^The restriction to nonedited books resulted from our inability to gauge macrocomparative 
content from the reviews of most edited volumes and also from our belief that, in general, refereed 
journals represent a better source of high-quality, chapter-length studies 
alternative was to use probability methods to sample from the master list We suspected 
that research styles might differ according to topical focus, however, and to explore this 
possibility, we needed sufficient works within each topical area to make meaningful comparisons. 
There was less topical variation than expected 
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on our impressions and informal discussions with colleagues, CSSH emerged 
as the single additional journal of central interest to macrocomparative 
sociologists, particularly historical sociologists. It was not always possible to 
tell if a contributor to CSSH was a sociologist, so we restricted our attention 
to authors listed in the membership directory for the American Sociological 
Association. 

So our critical review covers all nonedited macrocomparative books 
reviewed in CS between 1985 and 1990 (with particular attention to those in 
development and demography), all macrocomparative articles appearing in 
ASR, AJS, and SF during the same period, and all such papers by sociologists 
in CSSH. Note the implicit restriction to publications in English (or translations 
to English). Without knowing the universe, we cannot make objectively based 
claims on the relation of the works on our list to current practices in the field. 
Nevertheless, we designed our procedures to achieve broad coverage; our list 
includes works that exemplify all major approaches to macrocomparative 
sociology; and general statements based on it will be of interest—indeed, 
many countered perceptions of the literature that we shared before embarking 
on this project. f 

We identified 294 papers and books with a macrocomparative thrust. Table 
1 classifies each in terms of major topic. The most popular areas are political 
sociology, development, cultural sociology, and demography. There is more 
macrocomparative work in these subfields than in all other subfields com- 


Table 1 Macrocomparative works classified by major topic* (N = 294) 


Books Articles Both 
Topic (96) (96) (96) 
Political sociology 30 26 29 
International development 10 21 13 
Economic sociology 10 2 7 
Sociology of education 2 l 2 
Stratification 8 12 9 
Work and organizations l 7 2 
Cultural sociology 14 13 14 
Race and ethnicity 5 1 4 
Urban sociology 3 4 3 
Criminology and sociology of law 6 6 6 
Demography 12 7 11 


* Note: Sample consists of all nonedited books reviewed m CS, all articles published 
in ASR, AJS, and SF, and ali articles by sociologists in CSSH. 


‘Given space constraints, we cannot list the 294 papers and books here; the list can be obtained 
on request from the authors 
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bined. As we noted, the relative importance of development and demography 
as topics for macrocomparative research is one reason why we chose to focus 
on these topics in our in-depth examination of books. However, the more 
significant message of Table 1 is the reach of macrocomparative research in 
the discipline. The topic areas presented in this table are aggregates, the 
aggregation informed by shared views of the taxonomy of sociological 
subspecialties. Research on the political economy of the welfare state, for 
instance, is usually seen as falling into the domain of political sociology (not 
economic sociology). Similarly, research on the sociology of the family is 
usually linked with demography. Remembering that the major topic areas 
presented in Table | express this high level of aggregation, it is difficult to 
think of topics not influenced by research in the macrocomparative tradition. 


SAMPLE SELECTION 


Sample selection procedures shape the approach authors take to macro- 
comparative analysis, and create its strengths and weaknesses. This section 
demonstrates this claim in a critical review of sample size, composition, and 
method of selection. We begin with sample size and composition, and because 
we coded these characteristics for all nonedited books reviewed in CS, we 
base our comments on the entire set of macrocomparative books and articles 
identified. The discussion of sample selection again draws on all articles, but 
the in-depth examination of books is restricted to those in development and 


demography. 
Sample Size 


Number of cases is a basic characteristic, with implications for the macro- 
comparative methods employed as well as the kind of conclusions that can 
be drawn. Some writers prefer the greater depth of knowledge that may 
accompany a sample with fewer cases. For example, Tilly writes: 


On the whole, comparative studies of big structures and large processes yield 
more intellectual return when investigators examine a relatively small number of 
instances. This 1s not because of the intrinsically greater value of small numbers 
but because large numbers give an illusory sense of security. With small numbers, 
the student of a structure or process has little choice but to pay attention to the 
historical circumstances and particular characteristics of the cases at hand and 
thus to work harder at meeting the commonsense conditions of effective 
comparison (1984: 76). 


Others point out that because comparisons geometrically increase as cases 
increase, the application of case-oriented methods (i.e. Mill’s method of 
agreement and indirect method of difference) requires a relatively small 
number of cases (Ragin 1987: 50). 
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Some authors dispute the desirability of a small sample, however. 
According to Lieberson: “If data were available with the appropriate depth 
and detail for a large number of cases, obviously the researcher would not 
be working with [only a] few cases” (1991: 308—309). Lieberson also argues 
that choice of case-oriented methods is driven by sample size rather than 
vice versa (1991: 318). Other problems with small samples include limited 
"degrees of freedom” (Jackman 1985, Przeworski & Teune 1970) and the 
possibility that error in measurement has a greater impact (Lieberson 1991). 
A small number of cases might also be thought less adequate than a large 
sample for drawing general conclusions (e.g. Lieberson 1991: 311), although 
the nonrandom procedures typically used to select large samples lessen the 
importance of this difference. 

Differences of opinion about small versus large samples take for granted 
differentiation along this dimension. Indeed, before examining the literature, 
we shared this perception. We expected that sample size in macrocompara- 
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Table 2 Number of cases examined in macrocomparative research* 


(N = 294) 

1-2 cases 3-5 cases 6—20 cases >20 cases Total 
Media (96) (96) (96) (36) (96) 
Books 20 22 33 25 100 
Articles 26 19 13 42 100 
Both 22 21 27 30 100 


"Note Sample consists of all nonedited books reviewed in CS, all articles published 
in ASR, AJS, and SF, and all articles by sociologists in CSSH 


tive research would be bimodal, with one “hump” for studies based on a 
small number of countries and another “hump” for large N studies. Figure 
1 presents the distribution of sample size in the articles reviewed.” The 
expected bimodal pattern is not evident in this histogram. Rather, the 
distribution of sample size has a right skew, with the greatest number of 
studies having two cases and trailing off fairly evenly after that. We do 
not present a detailed histogram for the books reviewed in CS because it 
was not always possible to tell from the review the exact number of countries 
on which a particular study is based. Instead, we assigned each book-length 
study to a sample size category (1-2, 3-5, 6-20, more than 20). Table 2 
reports the distribution of books given these categories, and also, for 
comparative purposes, the articles. 

It is not accurate to describe macrocomparative research in terms of a 
dichotomy of large and small samples. What cut-point would one choose? 
Looking at Table 2, the distribution of studies across sample size categor- 
ies 1s relatively even. Over 40% of the books and articles are based on 
five or fewer cases. Books are more likely to be based on samples of 
6—20 cases, while articles are more likely to be based on greater than 20 
countries. 

However, there are differences by topic. Most works in race/ethnicity, 
stratification, and work/organizations are based on five or fewer cases. 
Samples in the 6 to 20 range are most likely in education, demography, and 
political and cultural sociology, but such samples never constitute the modal 
category. They are least likely in the areas of work/organizations and urban 
sociology. Samples larger than 20 are the norm for books and articles in 
development and demography. This is an important fact to note: It means that 
macrocomparative research based on large samples is overrepresented in our 
in-depth examination of book-length studies within the restricted areas of 


"Studies with “1 case are those that treat nations as part of a single world system or those 
considenng a single region as a whole 
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demography and development. Of course, the in-depth examination of journal 
articles is not restricted in this way.° 


Sample Composition 


We also began with the mistaken notion that most macrocomparative studies 
focus either on Western industrial nations or on developing nations in the 
semiperiphery or periphery. Table 3 shows the percentage of studies with 
samples restricted to the industrialized West, the (until very recently) 
centrally planned industrial economies of Eastern Europe, the Third World, 
and studies with samples drawn across these divisions of the world. Included 
are all books reviewed in CS as well as all journal articles. It is true that 
the former Soviet Union and its satellites are only rarely the focus of 
macrocomparative research; we hope this situation will change as these 
countries open up and more data become available. However, it is not true 
that the large majority of studies focus either on the industrial West or on 
the Third World. The modal study in our sample, whether published as a 
book or an article, draws on materials from a heterogeneous mixture of 
cases from around the world. 

It may also surprise some readers that studies do not always identify cases. 
This comment is based on an assessment of books on development and 
demography, for which we collected detailed information, and on all of the 
articles. Over 10% did not list the countries in the analysis. This is more 
common in articles (16%) than books (6%), possibly because of space 


Table 3  Political-geographic character of the cases examined" 


(N = 294) 

Books Articles Both 
Focus (%) (%) (96) 
First World 39 34 38 
(Former) Second World 2 0 l 
Third World 15 24 17 
Mixed 44 42 44 
Total 100 100 100 


“Note Sample consists of all nonedited books reviewed m CS, all 
articles published in ASR, AJS, and SF, and al] articles by sociologists in 
CSSH 


°We also found that about half of the articles in ASR and SF were based on greater than 20 
cases; articles in AJS were spread fairly evenly across sample-size categories, and the large 
majority of those in CSSH utilized fewer than six cases. 
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constraints.’ For example, Fiala & Lafree (1988) analyze child homicide in 
a sample of 40 developed and developing countries but only identify the 
developed countries in the sample. The developing nations are not listed or 
otherwise defined. Consequently, it is impossible for the reader to evaluate 
the generality of their results (since sample composition may affect the 
outcome). As part of minimum standards for documentation (cf Samara & 
Entwisle 1992), authors should, for this reason, identify the countries on 
which their research is based. 

As a third point, there is no guarantee that the case base is consistent 
throughout an analysis. Consistency is found only two thirds of the time and 
is somewhat less likely in journal articles (6496) than in books on development 
and demography (74%). Sometimes, authors bring in data from a more 
restricted source as their investigation unfolds, and this can change the sample 
composition (e.g. Messner 1989). Other times, authors use different data 
sources and samples to construct or evaluate different parts of their argument 
(e.g. Easterlin & Crimmins 1985). In some cases, the reasons for changes in 
a case base are unclear to the reader. In light of these practices, it bears 
restating that shifts in sample composition may affect the results of an analysis 
and stand as a competing explanation for the findings. 


Method of Selection 


How authors conceive of the universe of cases, and the procedures used to 
select cases for analysis, can be integral to the research design and has key 
implications for the conclusions that can be drawn. The CS reviews did not 
always report how samples were selected. Therefore, we restrict our comments 
in this section to books in demography and development and all of the articles. 
In our sample, the overwhelming majority (8896) of authors wish to draw 
general rather than historically specific conclusions, mostly about the opera- 
tion of structural forces in the context of long-run social change.” Perhaps 
this is why so many do not identify explicitly a group of countries or a span 
of time to which their findings would relate. Without a clearly defined 
universe, however, it is difficult to judge the generality of the results. 
Sample selection procedures are equally important. Surprisingly, not all 
authors explain why particular cases were chosen for analysis. Among those 
that do, the major reasons given are familiarity with certain cases, similarity 
along one or more dimensions, heterogeneity, particular values of the 


"The journals vary, with papers in CSSH (100%) and ASR (90%) more likely to provide details 
on the case base than papers in AJS (79%) or SF (75%). 

5 Again, we see some variation in joumals, with CSSH (89%), AJS (79%), and ASR (7196) 
higher than SF (4096). 

?Many of the exceptions appear in CSSH Almost half of the articles published in this journal 
have historical specificity as their goal. 
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dependent variable, data availability, or some combination of these.'? See 
Table 4, which provides information for nonedited books in demography and 
development and for all of the journal articles. Few of these works use 
probability methods, and most use them to select units within countries rather 
than to select the countries themselves (e.g. in a comparative analysis of 
several national surveys). Those few who employ probability methods to select 
societies (e.g. Leavitt 1986) draw their samples exclusively from predefined 
universes of premodern societies such as the Human Relations Area Files 
Quality Control Sample Universe (HRAF 1967) or the Standard Cross Cultural 
Sample (Murdock & White 1969). Most macrocomparative sociologists, 
however, tend to work with samples of modern nation-states. As a result, 
despite the interest in general conclusions, the degree to which the findings 
of most macrocomparative research can be generalized is, at best, unknown. 
We return to this point after discussing the contents of Table 4 and the 
advantages and disadvantages of particular sample selection strategies. 

The majority of studies select their samples based on characteristic(s) held 
in common by two or more countries. Included are studies that focus, for 
example, on Western industrial nations or countries in the Third World (cf 
Table 3). Other criteria of similarity include geocultural region (Bradshaw 
1985, Eckstein 1986, Evers 1987), network membership (Meyer 1986), 
hegemonic status (Kiser & Drass 1987) and even musical tradition (Cerulo 
1989). At least potentially, a focus on similar nations can hold constant some 
confounding variables that otherwise might be difficult to control. Countries 


Table 4 Sample selection criteria used in recent macrocomparative 
research* (N = 119) 


Books Articles Both 
Criteria (96) (96) (96) 
Famtiliarity 15 4 7 
Similarity 85 51 61 
Heterogeneity/deviant values 29 25 26 
Values of the dependent variable 15 13 14 
Data availability 24 34 31 
Probability sampling 6 9 8 
Other 6 15 13 


* Note: Sample consists of all nonedited books in development and demography 
reviewed in CS, all articles published in ASR, AJS, SF, and all works by sociolog- 
ists in CSSH Columns do not sum to 10096 because some studies employ multiple 
sample selection criteria 


Case selection may also be justified for purposes of illustration (e.g. Yamaguchi 1987, 
Johnson & Earle 1987). replication (e g. Hauser & Grusky 1988, Bollen & Jackman 1985, Gray 
1987), and historical interest (Baer et al 1990). 
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seldom seem similar enough to yield this benefit, however. Further, the 
generality of results based on countries selected in this way is unclear, a point 
rarely mentioned in the literature. For example, samples often are limited to 
large countries, but it is rare to see this restriction justified on substantive 
grounds or resulting inferences limited to macrostates. 

Data availability is another common factor in sample selection, relevant to 
the samples analyzed in over a quarter of the studies, especially those based 
on large samples.!' Though unavoidable, data problems lead to nonrepresen- 
tative samples. For instance, reliance on World Bank sources typically limits 
the sample to countries with populations that exceed one million, excluding 
microstates in Oceania and the Caribbean. More data are missing for former 
communist societies and for developing countries than for Western industrial 
nations. 

With these concerns in mind, we also examined the manner in which studies 
handled missing data within the samples that they defined. Problems with 
missing data were discussed in 4096 of the books and articles examined. In 
over two thirds (6796) of such instances, the author's response was to omit 
cases exhibiting missing values. Pampel & Park's (1986) study of the 
cross-national determinants of female retirement is typical. After defining a 
sample of 117 nations (those with populations greater than one million), they 
omit cases with missing values on a number of independent variables, leaving 
a sample of 93 nations. Other options were less popular. Missing values were 
estimated in 24% of the studies discussing missing data. In an analysis of 
trends in colonization, for instance, Boswell (1989) responds to missing values 
in a time series by estimating them with a linear interpolation. Researchers 
rarely drop variables in response to missing data; this occurred in 9% of the 
studies discussing missing data. Studies such as Pampel & Tanaka's (1986) 
analysis of economic development and female labor force participation are 
thus unusual. Faced with the common problem of missing data for the former 
communist societies, they respond by dropping variables exhibiting missing 
values and focusing a portion of their analysis on samples which include these 
nations. These alternatives (estimation and dropping variables) were twice as 
likely to be taken up in books (50%) as in articles (26%). To be sure, each 
of these options involves trade-offs. But systematic distortions in sample 
composition in macrocomparative research occur because of data limitations. 
More attention needs to be given to the impact of nonrepresentative samples 
in such work. 

A significant minority of authors select their samples to emphasize 
heterogeneity. Diversity enhances variation on the independent and dependent 
variables, and it also means that the sample will represent a wide range of 


Data availability ıs a factor in 81% of the studies based on greater than 20 cases 
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experience. Thus Sasaki and Suzuki (1987), for instance, select the United 
States, Holland, and Japan for their analysis of changes in religious commit- 
ment because they see these nations as exemplars of three forms of industrial 
society—the American, the European, and the Japanese—and of two forms 
of religion—the Western and the Eastern. Similarly, Burke & Lubeck 
(1987:663) select Nigeria and Iran for their study of social movements in 
oil-exporting nations because these nations "define something like the extent 
of variation." The language gets slippery, though, since a sample representing 
a wide range of experience is not a representative sample. How far results 
based on the former can be generalized is an open question. Burke & Lubeck 
(1987:663) caution that, as a consequence of their sample selection proce- 
dures, “meaningful generalizations are difficult to discern." 

Ragin (1987) and Tilly (1984) argue for methods that force the investigator 
to become intimately familiar with specific cases. We wonder how often it 
goes the other way, that is, that analysts choose cases with which they are 
already familiar. Prior familiarity was a stated reason for selecting particular 
cases in 15% of the books and 4% of the articles. As authors do not always 
fully describe personal and professional reasons for choosing particular study 
sites, these figures are likely underestimates. We know that prior familiarity 
played a role in several instances where this was not explicitly acknowledged. 
Indeed, prior familiarity can have its advantages. In Only Mothers Know 
(Raphael 1985), anthropologists with field experience who were returning to 
particular settings for their own purposes were recruited to add to their research 
agenda some observation of breastfeeding. Familiarity enhanced the validity 
and reliability of the data collected. 

Some samples are chosen based on values of the dependent variable. For 
example, implementation of Mill's method of agreement requires identifica- 
tion of all cases having a particular phenomenon in common. The goal is to 
infer cause from other characteristics held in common. The method of 
difference requires that cases in which the phenomenon is absent also be 
included in the comparisons, with the expectation that the cause will be absent 
in those additional cases (Ragin 1987, Skocpol & Somers 1980). Lachman's 
(1989) analysis of elite conflict and state formation in early modern England 
and France provides an example of the latter, although the author never says 
so explicitly. Lachman contrasts the different types of absolutism that he sees 
in these two settings, viewing each as both consequent and determinant of 
conflict among the crown, clergy, and gentry. In England, the success of the 
Reformation (the victory of the crown over the clergy) produced a “vertical” 
form of absolutism, which led ultimately, to the defeat of the crown in the 
Civil War. The opposite chain of defeat and victory is seen in France, along 
with a “horizontal” rather than “vertical” form of absolutism. Sample selection 
is an integral part of the research design, and the selection is on the dependent 
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variable (e.g. outcome of elite conflict). This is risky from the standpoint of 
causal inference, however—a point well known in the statistical literature 
(Heckman 1979, Berk 1983), but relevant to comparative research as well 
(Geddes 1991). 

An underutilized but potentially very useful strategy would be to select 
deviant cases for examination—i.e. to investigate countries that do not 
conform to widely held expectations about structure or process. One possibility 
would be to choose outliers in statistical studies and subject them to intensive 
qualitative analysis to gain insight into why these countries differ. This 
strategy would capitalize on the strength of statistical methods to establish 
which countries conform and the strength of other methods to provide a 
detailed accounting of the reasons for the departure. This marriage of 
approaches is rarely seen in the literature, though.’ Macrocomparative 
Statistical work often does not apply outlier diagnostics to identify cases not 
well explained by the model. And claims about the deviant status of countries 
in other macrocomparative work are not generally informed by the results of 
such analyses (when performed). 

In sum, many aspects of sample selection in macrocomparative research 
are problematic. Although most researchers wish to draw general conclusions, 
the degree to which they can realistically hope to do so is unclear. Cases 
arbitrarily enter and exit the sample. Studies rarely draw a probability sample 
of countries or time points. However, statistical significance tests might still 
be justified in terms of superpopulations where the observed sample is treated 
as a single realization of the possible samples that could be drawn from a 
hypothetical population. Alternatively, the rationale for significance tests can 
be based on analyzing causal mechanisms with a stochastic element. These 
arguments are controversial in many statistical applications (Freedman & Lane 
1983, Blalock 1972:238—39), however, and the issue deserves more attention, 
especially in macrocomparative research where few investigators draw prob- 
ability samples from a clearly defined population. Although drawing random 
samples is the best way to ensure representativeness, because of data 
limitations we expect that probability samples of countries will continue to 
be the exception rather than the rule. 


RESEARCH DESIGN 


By design, we refer to strategies used by researchers to untangle cause and 
effect and to enhance the internal validity of their work. Some contributors 
to the methodological literature see an understanding of cause and effect to 
be the central goal of macrocomparative research and organize their typologies 


For an example of this type of work, see Form & Bae 1988. 
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accordingly. Smelser (1976) views all comparative methods as having in 
common the attempt to control variation, although the methods to do so vary. 
Jackman (1985) also sees macrocomparative methods, whether qualitative 
(comparative) or quantitative (statistical), as centrally concerned with causal 
relations among variables controlling for confounding factors. Ragin (1987) 
states the goals of case-oriented (or qualitative) investigation to be causal 
analysis and historical interpretation. He sees the primary goal of the 
variable-oriented (or quantitative) strategy to be the testing of abstract 
hypotheses drawn from general theories. 

However, not all research designs have untangling cause and effect as their 
central goal. The typology proposed by Skocpol & Somers (1980) includes 
two alternative approaches. The first is parallel demonstration of theory, which 
refers to a case illustration or the illustrative application of a theory to two 
or more cases. The theory may implicitly or explicitly treat causes and effects, 
but studies of this type are meant to illustrate a theory, not to independently 
establish a causal factor. Contrast of contexts, which attempts to identify the 
limits of existing theories, is the second approach identified by Skocpol & 
Somers that does not seek to establish a causal effect. Making causal 
inferences about macrolevel structures and processes is an explicit goal of 
their third category, labeled “comparative history as macro-causal analysis.” 
Tilly’s (1984) typology also includes a variant of macrocomparative research 
not centrally concerned with cause and effect: individualizing comparisons, 
which establish exactly what is particular about a specific historical experi- 
ence. Cause and effect are more relevant to his other categories—niversal- 
izing comparison, variation finding, and encompassing comparisons. 

Many of the issues we address below are most relevant to studies that have 
explanation of a structure or process as their goal. We therefore limit our 
critical review to studies with this goal. This limitation is not very restrictive: 
of the 119 studies examined in detail, only 6 books and 5 articles are excluded. 
For the remaining 28 books and 80 articles, we consider several aspects of 
design: the role assigned to chance; the advantages and disadvantages of 
cross-sectional analysis; method of control; and the degree to which authors 
consider alternative explanations in their interpretation of the results. 


Stochastic vs Deterministic Relations 


We begin by examining the role assigned to chance in the design of 
macrocomparative research, a subject of interest in the recent methodological 
literature (Ragin 1987, Lieberson 1991). Explanations that allow for chance 
deviations have two possible origins. One possibility is that the likelihood of 
some outcome is less than certainty because the model or explanation is 
incomplete. Variables known to be relevant may be omitted because they are 
unavailable. Or, there may be exogenous shocks to the system (e.g. unusual 
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events or persons). An alternative possibility is that causality is inherently 
probabilistic. Few authors express this view explicitly, though many would 
probably agree with it. In contrast, determinism is predicated on two 
assumptions, that all potentially determining variables are known and included 
and that relationships are not inherently probabilistic but rather, when a given 
set of causes is present, the outcome follows with certainty. 

Ragin (1987, p. 51) gives sustained attention to assumptions about the role 
of chance in his typology of macrocomparative research methods. He 
differentiates the variable-oriented approach, which is probabilistic, from the 
case-oriented approach, which is not. For the latter, given the logic of the 
method of agreement, and also the indirect method of difference, a single 
deviant case can cast doubt on a cause-effect relationship established on the 
basis of many observations. Lieberson makes a similar point, but from a very 
different perspective: 

The formal procedures used in the small-N comparative historical, and organiza- 

tional analyses under consideration are all deterministic in their conception. 

Indeed, small-N studies cannot operate effectively under probabilistic assumptions 

because they would require much larger N’s to have any meaningful results (1991: 

312). 

If Lieberson (1991) and Ragin (1987) are right, then the small-N studies 
are using a deterministic conception of causality that characterized the early 
physical science models. It is ironic that critics such as Tilly (1984) and others 
suggest that large-N analyses are overly influenced by physical science models 
when, at least on this trait, it is the small-N studies that draw heavily on old 
models from the physical sciences. 

Deterministic models are rare in the recent macrocomparative literature. In 
Table 5 all of the articles (100%) and virtually all of the books (94%) surveyed 
assign some role to chance.'? This is true even of those studies employing 
Mill’s methods! It is difficult to know what to make of this. Somehow 
determinism has been decoupled from other logical elements of the procedures 
applied in the analysis of relatively small samples. We do not want to make 
this criticism too sharply, though, for not all studies using Mill's methods 
explicitly acknowledge doing so. Consider Lachman’s (1989) analysis of elite 
structure and state formation. Stochastic elements enter into the argument in 
` several ways—-as a consequence, for instance, of the author's interest in 
general relations (applicable to cases outside those analyzed) and his rendering 
of a nonholistic account. These elements are inconsistent with the assumption 
of deterministic relations that underlies Mill's methods. We must temper this 


PSeavoy (1986) 1s the most clearly deterministic work. One other book leans in this direction, 
and although ambiguous, we coded it as deterministic (Smookler 1984). Among articles, only 
Stephens (1989) seems to come close, but in this instance we decided that chance factors did 
have a role to play. 
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Table 5 Elements of research design in recent macrocomparative research" 


(N = 108) 
Books Articles Both 
(96) (96) (96) 
Assumptions about Chance 
Relationships stochastic 93 100 98 
Relationships deterministic 7 0 2 
Cross-sectional or Longitudinal 
Cross-sectional 7 43 33 
Longitudinal 7 18 15 
Both 86 44 52 
Method of Control 
No controls employed ST 22 31 
Method of agreement 7 0 2 
Indirect method of difference 14 5 7 
Statistical control 29 70 59 
Other methods of control 0 5 4 


“Note: Sample ıs restricted to works having explanation as a goal Although un 
common, more than one method of control may be used within the context of a single 
study. 


criticism, however, since Lachman (among others) never explicitly states that 
Mill's methods are being employed. 


Cross-sectional vs Longitudinal Designs 


Much debate has centered on the advantages and disadvantages of cross-sec- 
tional designs. On one side is Jackman (1985, p. 171—73), who argues that 
cross-sectional data can be useful in testing causal hypotheses, especially 
when the research focuses on long-term changes not realized within the typical 
range of data available.!^ Longitudinal designs are of little help when variables 
vary more over space than time. On the other side is Tilly (1984), who sharply 
criticizes the use of cross-sectional designs in macrocomparative research. In 
Tilly's view, underlying this design is the assumption that history follows a 
general sequence of set stages, and he argues that: 


There is no logical connection between the sequence of changes in those variables 
followed by individual countries and the differences that show up in a cross-sec- 
tion... Although multiple regression and similar techniques will show which 
characteristics covary in linear fashion, that covariation is as likely to result from 
diffusion or from common structural position in a worldwide system as from any 
internal logic of development (1984. 35). 


"Even when time series data are available, cross-sectional data may better reflect long-run 
changes (e.g. Cassidy 1981). 
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His recommendation is to move directly to longitudinal evidence. A 
cross-section is no substitute for a time series (Tilly 1984, p. 37). 

Many cross-sectional studies do not justify their choice of strategy nor 
discuss its limitations or strengths. Overlooked are such questions as: What 
is the relation between variation in the cross-section and changes over time? 
What is assumed about the countries in the analysis? What are the implications 
of diffusion? On the other hand, criticisms of cross-sectional designs exhibit 
some misunderstandings. For instance, Tilly's assertion that cross-sectional 
analysis assumes that history follows a general sequence of stages is 
disputable. Different combinations of values of the variables in a cross-sec- 
tional analysis—decreases as well as increases——can produce the same value 
for the dependent variable. No form of "evolution" need be assumed. 

Both cross-sectional and over-time data are prevalent in the literature: 93% 
of the development and demography books and 86% of the articles include 
some cross-sectional comparison; 9396 of the books and 6196 of the articles 
include some longitudinal comparison. There are important differences 
between books and articles, however. Books are twice as likely to combine 
cross-sectional and longitudinal inference. Is this because space is less of a 
consideration? Or is there something in the review process for books as 
opposed to articles that favors some discussion of longitudinal data? Among 
journal articles, the modal design is cross-sectional. Almost half are based on 
cross-sectional comparison alone." 


Method of Control 


Method of control is central to many typologies of macrocomparative research. 
For example, Jackman (1985) identifies three approaches: experimental, 
comparative, and statistical. He notes that experiments are rare, and indeed 
our review of the recent literature turns up no examples of this approach. 
Ragin (1987) proposes a similar typology, grouping the method of agreement 
and the indirect method of difference under the heading of case-oriented 
research, distinguishing it from variable-oriented research based on statistical 
methods. Skocpol] & Somers (1980) also group Mill's methods, identifying 
them as forms of macrocausal analysis. However, their typology excludes the 
use of statistical controls. For Tilly (1984) the methods of agreement and 
difference are key to universalizing comparisons, whereas statistical methods 
are central to variation finding comparisons. A third category, labeled 
encompassing comparisons, is more ambiguous. Any of the forms of control 
might be used in this context. This method begins with a large structure (e.g. 
world system) and selects locations within the structure to explain similarities 


SCSSH is the exception, with 7 of 9 studies combining longitudinal and cross-sectional 
inference. 
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or differences among those locations as consequences of their relations to the 
big structure. One could select locations to conform to one of Mill's methods, 
employ statistical controls, or even use no controls. 

Table 5 reports the fractions of the 108 books and articles using different 
types of controls. The surprise is that over a quarter of the studies oriented 
toward explanation use no method of control. The absence of controls is 
particularly noticeable for books—over half of those reviewed fall in this 
category. To be sure, the large majority of studies with no method of control 
do include some discussion of alternative explanations (13 of 16 books, 14 
of 17 articles). But, from the standpoint of valid inference, consideration of 
alternative explanations alone is not the strongest basis from which to draw 
conclusions about cause and effect. 

Very few authors use the method of agreement or the indirect method of 
difference. The method of agreement is found only twice (Seavoy 1986, Jones 
et al 1986), and the indirect method of difference in only a handful of studies. 
In Ragin’s (1987) view, the method of agreement and the indirect method of 
difference are defining features of a qualitative approach in macrocomparative 
research. If so, then qualitative works make up only a small minority of 
studies—-21% of books and 5% of articles. Is qualitative research really so 
unusual? Or is Ragin’s characterization of it incorrect? 

Statistical methods of control are much more common. They are used in 
the majority of studies, although they are more than twice as popular in journal 
articles as in books.’° Tilly (1984), for one, would be dismayed by this. He 
says that “Little of long-term value to the social sciences has emerged from 
the hundreds of studies conducted during the last few decades that have run 
Statistical analyses including most of the world’s nation states” (Tilly 1984, 
p. 76). Such a claim ignores two facts. First, some research within the 
quantitative cross-national tradition has been influential and of "long-term 
value." The inverted-U shaped curve between economic output and income 
inequality (Paukert 1973, Ahluwalia 1976), the positive association of 
industrialization and liberal democracy (Lerner 1958, Lipset 1959), and work 
on identifying positions in the world system (Snyder & Kick 1979) are just 
a few examples that come to mind. Second, Tilly's charge ignores the relative 
recency of large-scale cross-national statistical analyses. Only since the late 
1950s and early 1960s have such studies begun to thrive. Other modes of 
comparative research can claim a longer history. 


"When work m development and demography is omitted from the analysis, reliance on 
statistical controls 1s even more prevalent. 

' The use of statistical controls does not necessarily mean a quantitative cross-national analysis 
based on large numbers of cases, however Studies employing statistical controls at the micro 
level—3which may or may not include statistical controls at the macro level—are also included 
under this category. 


340 BOLLEN ET AL 


Table 6 Alternative explanations considered in recent macrocomparative 
research* (N — 108) 





Books Articles Both 

Alternative explanations (96) (96) (9) ` 
Omitted variables: 

Internal 14 49 40 

External 0 6 5 

Both 64 24 32 
Reciprocal causality 21 10 13 
Nonindependence of cases 7 4 3 
None 14 18 17 


"Note Sample ıs restricted to works having explanation as a goal Columns do not 
sum to 100% because studies may consider a variety of alternative explanations 


Alternative Explanation 


No research design is perfect. It is always possible to come up with alternative 
explanations for the results of an analysis. Omitted variables, reciprocal 
causation not specified in the model, and nonindependence of cases (“diffu- 
sion") may undermine valid inference about causal effects. Researchers are 
well advised to evaluate their own work critically, identify potential flaws in 
the design, and address and hopefully defuse the criticisms that others might 
make. As part of our review, we examined the extent to which authors 
interested in explaining structure or process do this (in print). We took note 
of any discussion of possible omitted variables (distinguishing between 
internal and external factors), reciprocal causation involving dependent 
variable(s) of central interest, nonindependence of cases ("diffusion"), and 
we also recorded when none of these possibilities was considered. * Table 6 
provides the breakdown in the more restricted sample of 108 macrocompara- 
tive works oriented toward explanation. 

As shown, the large majority of studies give at least some consideration to 
alternative explanations—over four of every five. Looking more closely at 
those studies with no consideration of alternative explanations, we find two 
types. Among the books, all but one also lacks controls. We were surprised 
to find that several of the books lacking controls and not considering 
alternative explanations have been quite influential works (e.g. Wallerstein 
1984, O'Donnell 1988). These authors share commitments to broad, wide- 


I5Possible interaction effects might also be considered a type of alternative explanation. We 
did not record discussion of interaction explicitly, but ıt ıs our impression that discussion of it is 
quite rare. In other words, we do not think that the picture would change very much 1f we had 
allowed for this or any other additional category 1n our classification of alternative explanations. 
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ranging arguments and to forceful styles of presentation. Perhaps this renders 
their audience more readily responsive than would be the case in the instance 
of less ambitious works. Among the articles, the large majority (10 of 14) 
with no self-criticism of study design employ statistical controls and of them, 
at least half represent quantitative, cross-national analyses based on large 
samples of countries. 

Of those authors giving some attention to alternative explanations, the large 
majority focus on omitted variables. Authors of book-length studies are much 
more likely to address external as well as internal factors that might account 
for the pattern of results obtained." More than two thirds of such studies do 
so. We cannot tell whether this reflects anything more than differences in 
space constraints. Much less attention is given to other kinds of alternative 
explanations. Only one in ten authors considered reciprocal causality. Among 
these are South & Trent (1988) who suggest an interesting alternative 
explanation for their finding of an association between “traditional” female 
roles and adult sex ratios. One possible explanation for this finding, they 
propose, is that observed sex ratios are influenced by women’s status. In 
societies in which women have low status, they may be underenumerated in 
censuses and surveys, and this underenumeration may lead to a comparatively 
high observed sex ratio. In essence, then, low status may both cause and be 
caused by high (male biased) sex ratios. Although they dismiss this alternative, 
they remind us that the possibility of reciprocal causality between the factors 
that macrocomparative sociologists typically examine should be considered 
even if ultimately rejected. Even fewer gave any explicit consideration to 
“Galton’s Problem;” that is, the possible nonindependence of cases. As with 
the use of probability methods to select samples, most authors attentive to the 
issue of diffusion utilize predefined universes of premodern societies from 
which (geographically) stratified samples can be drawn. Gray (1985) 
illustrates the importance of considering diffusion effects in macrocomparative 
research. His replication of previous work which has identified an association 
between the status of women and the character of religious belief produces 
no support for this finding. This result, it is suggested, is attributable to the 
use of a stratified sample that controlled for historical diffusion, a control not 
present in earlier work. The consideration of reciprocal causality and 
nonindependence as alternative explanations is roughly twice as likely to occur 
in books as in articles. 

Based on this review, we see several problems with research designs in 
macrocomparative research. It is disconcerting that 30% of studies with 
explanation as their goal had no method of controlling factors that could 


When work in demography and development 1s omitted from the analysis, reliance on internal 
factors as alternative explanations is more common. 
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undermine the chosen explanatory factors. The vast majority of the studies 
did offer discussions of alternate explanations for their findings, mostly 
consisting of omitted variables. Nevertheless, alternative explanation without 
some form of control is not the most solid basis from which to draw 
conclusions about cause and effect. Consequently, otherwise impressive 
pieces such as Chirot’s (1985) argument for the importance of physical 
geography and political decentralization for “the rise of the West” relative to 
other civilizations at the outset of the modern era are weakened by their 
inattention to controls. Are geography and political structure the only factors 
that distinguished Europe from China or India? In the absence of an attempt 
at controlling for the factors that such a question might bring to mind, one is 
unable to draw strong conclusions about causation. 


MEASUREMENT 


There is an irony concerning measurement in macrocomparative research. 
Most would agree that measurement is problematic, and the belief seems well 
justified. Countries differ in their capacity for data collection and in the quality 
and coverage of the data produced. Key definitions may vary across time and 
place (e.g. “urban” may refer to places with a minimum population size 
ranging from 2500 to 20,000 and may not take account of administrative 
function or economic characteristics). Structural differences between countries 
create problems for measurement (e.g. how to compare economic output in 
market and command economies). Nonquantitative sources may be contra- 
dictory or incomplete, problems especially relevant to macrocomparative 
historical research. Typically, data for macrocomparative research come from 
secondary sources.” With secondary sources, seemingly trivial problems such 
as typographical error can be a concern." Even the few studies relying on 
primary sources have problems with the subjectivity of interpretation, 
questions about replicability, and nonrepresentativeness of sources. The irony 
is that despite general acknowledgment of its deficiencies, measurement is a 
neglected area in macrocomparative research. 

To some extent the irony is manifested in treatments of methodological 
issues and research strategies. Few authors give any attention to measurement, 
none of them recent. Przeworski & Teune (1970) devote two chapters of their 
book to issues of direct and inferred measurement in different contexts and 
establishing measurement equivalence across systems. Smelser (1976), Armer 
& Grimshaw (1973), Vallier (1971), and Elder (1976), too, give considerable 


only 14% of the books and articles reviewed here draw on primary sources, and half of 
them also consider secondary data Data from primary sources are more likely to be the focus 
of books than articles, but the difference is less than might be expected (26% versus 14%) 

?!In at least one instance, a key outlier turned out to be due to an error in a published data 
source (see London 1988a and his letter to the editor—1988b). 
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attention to description, classification, and measurement in comparative 
research. But these are the exceptions and are not recent publications. Indeed, 
Ragin (1987) sets forth a strategy for analysis that could accentuate measure- 
ment problems when he proposes that variables be dichotomized in a Boolean 
scheme of comparative analysis. Dichotomizing continuous variables adds 
catagorization error to the measurement error already present in the continuous 
variables. 

Neglect of measurement is a significant omission in current methodological 
work. Not surprisingly, the same could be said of the application of 
macrocomparative research methods as well. Measurement issues arise in 
descriptive as well as explanatory studies, so, for this reason, we include both 
types of studies here and return to our sample of the 119 books and articles 
examined in detail. The rest of this section documents the state of measurement 
in macrocomparative research with respect to the aspects summarized in Table 
7. We begin with definitions of concepts, both theoretical and operational, and 
then proceed to issues of validity, reliability, and comparability. 

Development, democracy, revolution, ethnicity, and status of women are 
just a few of many concepts in macrocomparative research with multiple 
meanings and connotations. To avoid confusion, authors need to explain how 
they are using terms that identify the major concepts of a study and, ideally, 
distinguish the dimensions of those that are multidimensional. In practice, the 
large majority (85%) of books on topics in development and demography 
books did provide explicit definitions, although a surprising minority (15%) 
did not. All but one article gave definitions; but dimensionality was generally 
not addressed. Further, among those considering dimensionality, very few 
performed specific tests.” Discussions of dimensionality, even just to say that 
a concept is unidimensional, are a sign of a more in-depth treatment of 
theoretical concepts. Tests of it can range from substantive arguments based 
on historical material to statistical tests based on the application of factor 
analysis. Although authors generally discuss the meaning of their central 
concepts, these discussions of dimensionality are limited. 

One way to anchor theoretical discussions to real-world conditions is by 
providing ways to observe concepts in action. Operationalization refers to the 
methods used to accomplish this. These methods often involve quantitative 
measurement but also may be based on qualitative accounts. Most journal 
articles provide a way to track concepts, although only half of the books do 
so.” Few authors in either medium explore alternative measures of their 


2 Although discussions of dimensionality are no more likely to appear 1n articles than books, 
the former are more likely to provide explicit tests. 

“Only 14% of the macrocomparative preces published in ASR failed to anchor their concepts 
to observable variables. 
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Table 7 Characteristics of data and measurement in recent macrocomparative research* 
(N = 119) 


Books Articles Both 


(%) (%) (%) 
Source of Data 
Primary sources 6 7 8 
Secondary sources 74 91 86 
Both 21 2 8 
Handling of Missing Data 
No discussion 71 57 61 
Cases dropped 21 35 31 
Variables dropped 6 4 4 
Variables estimated 15 9 11 
Measurement 
Concepts explicitly defined 85 99 95 
Dimensionality considered 27 21 27 
—explictly tested (when considered) 22 30 28 
Operational variables explicitly defined 50 91 80 
Altemative measures considered 29 26 27 
Multiple indicators incorporated 18 26 24 
Same variables used 1n each nation 85 89 88 
Cross-national validity of measures discussed 24 22 23 
-measures equally valid m each nation (when discussed) 75 84 76 
Reliability discussed 29 25 26 
Relability measured 6 14 12 


* Note: Sample consists of all nonedited books in development and demography reviewed m CS; all articles 
published in ASR, AJS, SF; and all works by sociologists in CSSH 


central concepts (27%), despite the advantages of doing so. How robust are 
the results? Are the same conclusions reached if major concepts are measured 
in some other way? Sensitivity to alternative sources or measures affects the 
confidence with which inferences can be made. 

The use of multiple indicators is also unusual. Multiple indicators resemble 
alternative measures in that more than one indicator operationalizes a concept. 
The difference is that multiple indicators are included simultaneously, and 
alternative measures sequentially. London & Smith’s (1988) analysis of 
urbanization and economic growth, for instance, simultaneously employs two 
measures of urban-rural inequality to operationalize their more general concept 
of “urban bias.” The use of multiple indicators can enhance the validity of 
measurement. Considering several measures simultaneously lessens the im- 
pact of the idiosyncratic behavior of single indicators. However, multiple 
indicators appear in only a small minority of the books (18%), all but one of 
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them in the area of demography. They appear in a similarly modest percentage 
of journal articles (2696). 

Conceptualization, dimensionality, alternative measures, and multiple in- 
dicators are generic issues in all sociological research. Problems of compara- 
bility, however, figure more prominently in macrocomparative research than 
other traditions. It is not always possible or desirable to use the same variables 
or qualitative sources of information for different settings. For example, 
consider the young woman who has reached her family size goal. In the United 
States, if you ask such a woman whether it would be a problem if she were 
to get pregnant, she is likely to say yes. In Costa Rica, she may say no. This 
is because in Costa Rica, such a question may be perceived as a veiled inquiry 
about the likely use of abortion rather than a measure of commitment to a 
family size goal. 

One challenge of macrocomparative research is finding or constructing 
measures equally valid in different countries. For instance, does a survey 
question in one society have the same meaning 1n another society? The example 
just given does not. Or, does a qualitative historical account do as good a job 
documenting a phenomenon in all the countries it covers? It is only natural that 
authors will be better informed about some cases than others. Can we compare 
the accounts of different historians discussing the same phenomenon in 
different countries? Clearly if the answers to these questions are no, then doubt 
is cast on comparative analyses using such information. 

Equal validity across settings is essential in macrocomparative analysis. 
One might expect to see it considered in depth in the empirical literature. This 
is not the case: less than one quarter of the books and journal articles discuss 
whether measures are equally valid in different countries of their samples. 
Sometimes there is only a passing reference to issues of equal] validity, but 
other times a more rigorous approach is evident. Loscocco & Kalleberg 
(1988), for instance, work to assure the cross-national validity of survey items 
regarding work commitment and values by employing a multiple or back- 
translation of items between the languages of the nations studied. For most 
other researchers, however, the implicit assumption seems to be that measures 
are equally valid. Perhaps authors feel that using identical measures guarantees 
equal validity. The large majority of studies do use the same measures for all 
of the countries in their analysis (88%). 

Validity and reliability are dual goals in measurement. Validity is whether 
a measure really measures what it is supposed to measure. As noted, few 
studies address measurement validity. Reliability refers to the stability or 
consistency of a measure, i.e. the extent to which remeasurement using the 
same methods would lead to the same results. Few authors in our sample of 
books and articles discuss reliability (26%), which should temper our 
confidence in the quality of measurement and in the results. When reliability 
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is addressed, it is unlikely to be assessed empirically.” Just 6% of the books 
and 1496 of the articles report the reliability of their measures. Attention to 
the reliability of measures takes a variety of forms in the literature that we 
review. In a study of economic discrimination and homicide, for instance, 
Messner (1989) assesses the reliability of the measure of the homicide rate 
employed by introducing an additional, independent estimate of the homicide 
rate and observing the differences in parameter estimates that result. 

Major measurement problems are expected in macrocomparative research, 
but if one were to judge by current practices, one might be led to a different 
conclusion. Issues surrounding measurement usually are overlooked. Defini- 
tions of major concepts are commonly provided, but the dimensions of 
concepts are neglected. In half of the books there is no explicit discussion of 
measures, Roughly three quarters of the books and articles do not consider 
alternative measures or multiple indicators of constructs, whether measures 
are equally valid in the different countries, or the reliability of measures. 
These are basic problems, as troublesome for descriptive and illustrative works 
as they are for works oriented toward explanation. Perhaps the expansion of 
international data sets, and especially surveys collected in multiple settings, 
will renew attention to issues of measurement. 


CONCLUSIONS 


As with the research we review, our study has limitations. In using CS, ASR, 
AJS, SF, and CSSH to define our sampling frame, we undoubtedly have 
omitted significant pieces of recent macrocomparative research. Without 
knowing the universe, we cannot claim to have sufficiently represented current 
practices in the field. Nevertheless, we draw on works of diverse theoretical 
and methodological orientation, representing all major approaches to macro- 
comparative sociology, and our general conclusions should be of interest. 
While some confirm shared perceptions (and suspicions) of the macro- 
comparative literature, others directly counter assumptions that we had at the 
outset of this project. 

Work in the areas of development and demography weighs heavily in the 
sample of books and articles that we submit to a detailed analysis, constituting 
over half of the research intensively reviewed while making up only a quarter 
of the macrocomparative publications appearing in our sources. While our 
review has revealed an unanticipated level of continuity and similarity in the 
literature, we were concerned that demography and development would 


25 Reliability can be assessed 1n a variety of ways. For qualitative accounts, one might focus 
on the extent of agreement between different sources For quantitatrve data, more formal measures 
of reliability are available 
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exhibit relatively distinct styles of research and that oversampling in these 
areas would bias our findings. For one, works on these topics are distinct in 
their tendency to employ large samples. But when we exclude such works 
from consideration and replicate our analysis of sampling, research design, 
and measurement, we find much consistency. Aside from a heavier reliance 
on statistical controls and on internal factors as alternative explanations in 
work in other topical areas, the results are very nearly identical. While we 
can not assess the implications of drawing books from other topic areas, this 
similarity lends confidence to our conclusions. 

In addition to limitations related to our sampling of the macrocomparative 
literature, our conclusions may be sensitive to the coding of methodological 
practice. We do not think so, however. Most of the coding involves relatively 
objective features (e.g. sample size, use of controls, explicit definitions) and, 
not surprisingly, there was a high level of agreement (29090) in the 
classification of studies according to these characteristics in some limited 
checks we performed. Further, it seems unlikely that coding error could have 
produced the patterns we describe. Most of our observations are based on 
practices in a preponderance of studies. Coding adjustments would change 
little of this commentary. 

We find commonalities in the sampling, research design, and measurement 
practices of macrocomparative research. Nearly all the studies we investigated 
had explanation as a goal. Researchers can take steps to lessen the threats 
that can undermine their explanations. We chose to see the extent to which 
these steps were taken. Eighty percent or more of the books and articles 
assume stochastic relations and consider alternative explanations for their 
results. It is rare that diffusion is included among these alternative explana- 
tions. Mill’s methods of control were infrequently employed. Probability 
sampling in selecting cases is extremely unusual, but listing the countries of 
the sample is common. Concepts are defined, and the same variables or 
sources are generally consistent throughout a study. Authors seldom test the 
dimensionality of concepts, nor is reliability empirically assessed. A smaller 
percentage, but still a majority of the books and articles have longitudinal 
components in their investigations. The majority do not discuss the dimen- 
sionality of concepts, consider alternative measures or multiple indicators, 
address the reliability of data, or discuss the cross-national validity of the 
measures that they employ. 

We found some differences between books and journals. Books (50%) were 
much less likely than journals (91%) to have operational variables defined. 
The majority of books (59%) had no method of control to strengthen the 
credibility of their explanations while 75% or more of the papers in ASR, 
AJS, and SF had methods of control. CSSH had only one article that applied 
control procedures, although the vast majority liberally suggested alternative 
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explanations. Statistical controls were by far the most common (70% or more) 
method of contro] for ASR, AJS, and SF. Books were more likely than journal 
papers to include both internal and external factors in their alternative 
explanations for their findings. About 90% of the books and CSSH included 
longitudinal data, a larger percentage than ASR, AJS, or SF. 

Our evaluation of macrocomparative research as practiced reveals several 
major methodological weaknesses. A major threat to the internal validity of 
a good number of these studies is the failure to apply any method of control. 
The explanatory factors are not checked to determine whether their putative 
influence is due to other factors. Given that explanation is the goal of much 
of macrocomparative research, we see this as a serious threat to attaining that 
goal. It appears that some works substitute a discussion of alternative 
explanations for the phenomenon for explicit controls. Though this can further 
the credibility of an hypothesis, it is not a sufficient replacement for control. 
It also bears mentioning again that diffusion is a plausible determinant of a 
nation's traits, yet it is extremely rare for macrocomparative work to 
investigate this alternative. 

External validity also requires attention. Most of the studies did not see 
their results restricted to the specific countries and time points that they 
analyzed. Yet, few discuss the generality of their results or the justification 
for any generalizations that would be made. Generalizing findings 1s compli- 
cated by the fact that probability sampling 1s hardly ever used to select cases 
for analysis. Similarity of cases and data availability are far more common 
sampling mechanisms. But what does this imply for the generality of the 
findings? Are significance tests reasonable in these cases? Macrocomparative 
research has largely overlooked these questions to date, but they have clear 
implications for the extent we can generalize findings. Fuller examination of 
this methodological issue could advance macrocomparative research. 

Measurement is also problematic in macrocomparative research. Com- 
plaints are frequent, but actions are rare. Data published by the United Nations 
or World Bank are typically treated as valid and reliable. Historical sources 
are rarely compared. Multiple indicators are seldom used. Measures are 
assumed comparable across countries. The expansion of international data 
sets, especially surveys conducted in multiple settings, will spur a renewed 
attention to these and other measurement issues. 

We began by identifying factors driving renewed interest in macrocompara- 
tive research. These social forces will continue to fuel macrocomparative 
research efforts, and the methodological issues we raised are likely to reappear 
in this work. For instance, can we truly understand the collapse of communism 
in the former Soviet Union and its satellites without considering diffusion 
processes? Will our knowledge of democratization be distorted by sampling 
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only those countries that experience increases in democracy? Are we making 
adequate assessments of world system effects on foreign debt if we fail to 
explicitly control for confounding influences? If surveys and aggregate data 
are not comparable across countries, how much of a benefit is their increased 
availability? Macrocomparative research is generating intriguing hypotheses 
and evidence on issues of central sociological importance. Problems of 
research design, sampling, and measurement will not diminish with its growth. 
Rather, the need to address them will only be felt more strongly. 
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Abstract 


Research on life transitions highlights the normative and nonnormative 
changes that individuals experience over time. During the past two decades, 
life course perspectives have provided a strategic context for studying the 
genesis of life transitions and their personal and social consequences. Both 
population-based and individual models of transitions have become more 
complex, focusing on the ways that social and historical contexts shape life 
transitions. Át the individual level, progress has also been made in identifying 
the mechanisms by which transitions affect outcomes. Research on life 
transitions continues to grapple with two major issues—the challenges raised 
by heterogeneity, and the need to better link macro and micro perspectives— 
although advances have been made in both cases. One of the most promising 
characteristics of recent studies is cross-fertilization of concepts and methods 
from previously distinct research traditions: role theory, social stress theory, 
and life course sociology. 


INTRODUCTION 


During the past two decades, sociologists have devoted increased attention to 
history, to heterogeneity, and to the dynamics of change. One research area 
in which these issues have been the dominant themes and challenges is the 
study of life transitions. Thus, the sociology of life transitions shares 
intellectual concerns with the discipline more broadly. The study of life 
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transitions has unique history and themes as well. Although recent efforts 
have been especially vigorous, the roots of research on life transitions can be 
traced to classic perspectives on social roles, the relationships between social 
location and personal well-being, and the mechanisms by which social 
contexts shape individual lives. 

This chapter examines progress in the study of life transitions. Initially, 
theoretical orientations that made important but limited contributions to the 
study of life transitions are briefly reviewed. The subsequent section depicts 
the contributions of life course perspectives to research on life transitions. 
The final section examines emerging issues and challenges in the study of life 
transitions. In line with the themes noted above, issues of history, heteroge- 
neity, and change receive particular attention throughout the chapter. 


PRECURSORS OF THE STUDY OF LIFE TRANSITIONS 


Both role theory and social stress theory predate recent interest in life 
transitions. Both theories provided insights that were encompassed by, and 
typically expanded upon, in life transition studies. In this section, role theory 
and social stress theory are briefly reviewed, with discussion restricted to 
concepts and findings that contributed to the study of life transitions. 


Role Theory 


Linton (1936) introduced the initial elements of role theory, defining status 
as a position in social structure and role as the expected behaviors of status 
occupants. Over time, use of the term "role" broadened in two ways. First, 
role is now used to describe both a status and the behaviors associated with 
it. Second, role can refer to either the behaviors expected of a status occupant 
or the behaviors exhibited by a status occupant. Role theory became more 
complex—and more directly relevant to the study of life transitions—with the 
emergence of interest in the dynamics of role allocation and socialization (e.g. 
Biddle 1979, Brim 1966). Role allocation refers to the processes by which 
roles are assigned to individuals and to the related dynamics of role entry and 
exit. Socialization refers to the processes by which social structure transmits 
to individuals the skills and attitudes compatible with the roles that they enact. 
Role theorists view social norms as the cultural referents that permit role 
allocation and socialization to occur in a routinized and predictable manner. 
Norms are the basis of shared expectations about role behavior, the allocation 
of roles to individuals or subgroups, the timing of role entry and exit, 
socialization experiences that facilitate role performance, and social sanctions 
that are applied when individuals fail to perform their roles effectively (e.g. 
Brim 1966, Clausen 1968). 

The link between role theory and life transitions is straightforward: role 
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entry and exit are, by definition, transitions (Allen & van de Vliert 1984, 
George 1980). Because of this link, role theory makes two major contributions 
to the study of transitions. First, role theory offers a potential explanation for 
the genesis and timing of life transitions. Most transitions are normatively 
governed and, hence, predictable in both occurrence and timing. Second, 
when available, socialization provides individuals with the skills needed to 
master transitions and perform new roles effectively. These principles lead to 
the prediction that role-based transitions typically operate smoothly, with little 
disruption to individual lives or to social structure. 

With a few exceptions, role theory has been relatively dormant during the 
past three decades. The concepts of role and socialization are sometimes used 
to interpret empirical findings, but direct tests of role theory are rare. 
Ultimately, the breadth and generality of role theory may be both its major 
attraction and its greatest flaw. Role theory is attractive as a heuristic or 
general metaphor for thinking about links between social structure and 
individual behavior. Because role theory is so general, however, it is difficult 
to specify results that would refute it. 

Other problems with role theory are relevant to the study of transitions. 
Although role allocation and socialization are, by definition, processes, role 
theorists have paid little attention to time per se. The major acknowledgment 
of time in role theory is the distinction between role entries and exits that 
occur “on-time” or “off-time,” that 1s, unexpectedly (e.g. Hagestad & 
Neugarten 1985), with the latter more likely to disrupt personal lives and 
social structure. Role theory also pays insufficient attention to heterogeneity, 
primarily because social context is largely ignored. A view of individuals 
embedded in heterogeneous social contexts that directly affect role allocation, 
socialization, and role performance has not been part of traditional perspec- 
tives on social roles (Granovetter 1985). 

One recent work, relevant to the study of transitions, illustrates both the 
strengths and weaknesses of role theory. Ebaugh (1988) studied role exits, 
correctly noting that they have received less attention than role entry. Based 
on interviews with persons who had exited one of nine roles, ranging from 
ex-nuns to transsexuals, Ebaugh concluded that the process of “becoming an 
ex” involves four stages: experiencing first doubts; searching for alternative 
roles; occurrence of a turning point (i.e. a specific event that triggers role 
exit); and creation of a new identity as former role occupant. Ebaugh also 
identified characteristics that shape the role exit. For example, role exits are 
easier to negotiate if they are potentially reversible and are voluntary. 

Ebaugh’s work advances role theory in a number of ways. First, Ebaugh 
views role exit as a process. Second, this is the first study that attempts to 
identify a model of role exits that generalizes across roles. Third, identification 
of factors that shape the nature and outcome of role exits demonstrates greater 


356 GEORGE 


sensitivity to heterogeneity and social context than most previous work. But 
this study also illustrates some of the limitations of role theory. Most 
importantly, both timing and heterogeneity are inadequately examined. 
Because her sample consisted only of persons who had completed role exits, 
information was unavailable about unsuccessful attempts to relinquish roles. 
There also was no attempt to link the role exit process to social location; in 
_ particular, the genesis of role exits was unexamined. 

Ultimately, role theory provides only a very abstract view of life transi- 
tions—a view that fails to adequately consider timing, heterogeneity, and 
social context. Role theory helped to shape studies of life transitions. More 
importantly, part of the distinctive nature of research on life transitions is a 
result of attempts to overcome the limitations of role theory. 


Social Stress Research 


The major impetus to development of stress research was the desire to better 
understand the relationships between social location and illness. Demogra- 
phers and epidemiologists had long documented that social location—espe- 
cially sex, race, and social class—are robustly related to the risk of physical 
and mental illness. During the 1960s, scholars began to investigate stress as 
a potential mechanism for explaining those relationships (e.g. Dohrenwend 
& Dohrenwend 1969, Langner & Michael 1963). 

Initial conceptualizations of the link between stress and illness were simple 
and based on a homeostatic view of human lives. Stressful events were 
hypothesized to mediate the effects of social location on illness. Thus, women, 
older adults, members of ethnic and racial minorities, and persons who were 
economically disadvantaged were expected to experience higher levels of 
stress which would, in turn, increase their risk for physical and mental illness 
(e.g. Kessler 1979, Srole et al. 1962). Stress was typically operationalized 
as the number of life events experienced during a specified time period (e.g. 
Holmes & Rahe 1967). Most life events included in standardized scales are 
transitions, which is the direct link between stress research and the study of 
life transitions (George 1980). Early tests of this simple model yielded mixed 
results. In general, life events were significant predictors of illness onset, but 
the magnitudes of these relationships were modest (see Rabkin & Struening 
1976 for a review of that early research). 

Over time, a more complex view of stress and its consequences evolved. 
The most important element of this evolution was a change in the formulation 
of research questions concerning the links between stress and ilIness. In early 
studies, the primary research question was whether stress mediates the 
relationships between social location and illness. Later studies examined the 
conditions under which stress leads to adverse outcomes. Thus, social stress 
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theory was transformed from a mechanistic view to a richer, more complex 
perspective that emphasizes contingencies and heterogeneity. 

Two other elements of stress research are relevant to the study of transitions. 
First, as stress theory evolved, investigators became concerned about the 
validity of summing the number of life events experienced in a defined time 
period and assuming that this indexed amount of stress. Some scholars also 
suggested that summary scales impede identification of the unique antecedents 
and consequences of specific life events. Although life event scales are still 
used, two other approaches have become increasingly common. One approach 
is to study single events (e.g. divorce, retirement) (e.g. Dohrenwend et al 
1987). Other investigators suggest an intermediate approach in which theo- 
retically relevant subsets of events are aggregated (e.g. family events, exit 
events) (e.g. Thoits 1987). The appeal of this approach is that events need 
not be examined individually, but the level of aggregation also will not mask 
distinctive relationships between events and their consequences. 

Second, stress research now focuses on identification of the factors that 
mediate and moderate the impact of life events on well-being (e.g. Lin 
1992). Life events have different effects, depending on the conditions under 
which they occur and the resources available to individuals experiencing 
them. A large literature suggests that multiple factors affect the stress 
process. Examples of such factors include personality, economic resources, 
coping efforts, and social relationships, especially receipt of emotional and 
instrumental support from significant others. Space limitations preclude 
review of these factors (but see Pearlin et al 1981 and Pearlin 1989 for 
theoretical models underpinning recent stress research and George 1989 for 
a review of studies). 

Stress research has made two primary contributions to research on life 
transitions. The most important of these is recognition of heterogeneity among 
events, consequences of events, and the contingencies that shape the impact 
of life events. In addition, stress research demonstrates that life events (and, 
therefore, life transitions) are important for health and well-being. Physical 
and mental health are not the only important consequences of transitions, but 
stress research does the most effective job to date of demonstrating that the 
effects of transitions are of practical importance. 

The narrowness of much stress research poses obstacles to an understanding 
of life transitions, however. Three problems are especially relevant. First, and 
most important, stress research pays insufficient attention to social structure, 
macrolevel social processes, and social contexts that are related to the 
occurrence and consequences of life events (Jacobson 1989, Pearlin 1989). 
Unlike role theory, for example, stress research largely ignores the fact that 
transitions tend to be normatively governed and are, therefore, often predict- 
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able and socially desirable. Emphasis on individuals, extracted from social 
structure, continues to dominate this field. 

Second, reflecting the applied and epidemiologic foundations of stress 
research, many studies of life events focus on prediction rather than 
explanation, although this is less true of recent studies. Stress research too 
often emphasizes the prediction of illness rather than explanation of the 
processes by which life events affect well-being. Finally, although stress 
research focuses on process, timing per se has been largely ignored. Little 
attention is paid to the timing, as compared to the occurrence, of life events 
and to the expected lag between the occurrence of an event and its 
consequences. 


TRANSITIONS IN LIFE COURSE PERSPECTIVE 


A distinctive sociology of the life course has emerged in the past two decades. 
Although there is no unified theory of the life course, life course perspectives 
share common principles, three of which are especially relevant to the study 
of life transitions. First, the life course is a social phenomenon, distinct from 
the life span (e.g. Hagestad & Neugarten 1985). Life span refers to duration 
of life; life-span characteristics are closely related to age and largely invariant 
across time and place. In contrast, the life course reflects the intersection of 
social and historical factors with personal biography (Elder 1985a). Hence, 
life course patterns are expected to vary across time, space, and populations. 

Second, life course perspectives focus on “age-differentiated, socially 
recognized sequences of transitions” (Rossi 1980). Transitions and trajectories 
are key concepts in life course research (Elder 1985a, Hagestad 1990). 
Transitions refer to changes in status that are discrete and bounded in duration, 
although their consequences may be long-term. Trajectories are long-term 
patterns of stability and change, often including multiple transitions, that can 
be reliably differentiated from alternate patterns. Transitions and trajectories 
are interrelated. As Elder notes, “transitions are always embedded in 
trajectories that give them distinctive form and meaning” (1985a). 

Third, and as implied by previous principles, life course studies require a 
dynamic, longitudinal perspective. Unfortunately, longitudinal data spanning 
long periods of time are scarce. Consequently, most life course research 
examines transitions rather than trajectories, although there are important 
exceptions as noted below. 

Life course studies of transitions can be conveniently divided into two 
subsets, based on unit of analysis: population-based studies and studies of 
individuals. Each tradition will be reviewed in terms of the research issues 
examined, major findings, and implications for understanding transitions. 
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Population-Based Studies of Transitions 


Population-based studies have examined three major issues: the timing of 
transitions, sequences of transitions, and transitions as life course markers. 


TIMING OF TRANSITIONS In population studies, the timing of life course 
transitions is usually operationalized as either the average or median age at 
which a transition occurs or the age by which a majority of cohort members 
have experienced a transition. Variability of transition timing is typically 
examined both across and within cohorts. 

During this century, the average age at which several transitions occur has 
changed. Compared to earlier cohorts, recent cohorts of men are older, on 
average, at school completion and age of first full-time job. For both 
transitions, the median age increased by two years for men born in 1948, as 
compared to those born in 1907 (Hogan 1981). The timing of marriage also 
has changed. From 1900 til 1930, median age of first marriage declined nearly 
three years for men (from 26.0 to 23.3). After that, median age at marriage 
declined more slowly, reaching a steady-state of about 21 years for men born 
between 1933 and 1951 (Hogan 1981). More recently, median age at first 
marriage has increased (Watkins et al. 1987). Intracohort variability in these 
transitions is significant. Persons of lower social class, African-Americans 
and Hispanics, and rural residents complete these transitions earlier than their 
peers (Hogan 1981). In general, women experience these transitions earlier 
than men, but cross-cohort patterns parallel those of men (Hogan 1985). 

The timing of transitions in middle and later life also changed during the 
twentieth century. The average age at which individuals experience death of 
the last parent has increased dramatically (Gee 1987, Uhlenberg 1980, 
Winsborough 1980). Over the same period, average retirement age has 
dropped steadily, despite increases in life expectancy that suggest the potential 
for extending employment to later ages (Tuma & Sandefur 1988). Widowhood 
has become more strongly linked to both sex and age since the late 1800s, 
occurring later and with women at much greater risk (Martin Matthews 1987). 
Intracohort variability in the timing of these transitions is substantial. Persons 
of Jower socioeconomic status and members of disadvantaged racial and ethnic 
minorities experience parental death, retirement, and widowhood at younger 
ages, on average, than do their peers. 


SEQUENCES OF LIFE TRANSITIONS Investigators also have examined se- 
quences of life transitions, their compression or dispersion over time, and 
their consequences. Initial hypotheses posited that transition sequences are 
governed by norms specifying the appropriate ordering of transitions and that 
deviance from those norms would result in negative outcomes. 
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Most research on transition sequences focuses on entry into adulthood. 
Modell et al (1976) performed the pioneering work; Hogan (1978, 1981) 
examined the issue most comprehensively. The normative sequence for three 
key transitions of early adulthood consists of leaving school, followed by first 
full-time job, followed by first marriage. Using data from American men born 
between 1907 and 1952, Hogan found/that disorderliness is substantially more 
prevalent among recent than earlier/cohorts. Recent cohorts also completed 
the three transitions in a shorter period of time than earlier cohorts, suggesting 
that the transition to early adulthood has been compressed. Similar findings 
are reported by other investigators, including those using data from other 
western societies (e.g. Featherman & Sorensen 1983, Kerckhoff 1990). The 
youngest men in Hogan’s study were born in 1952. Thus, recent trends toward 
later marriage were not represented in his data. 

Hogan found limited evidence that disorderly transition sequences result in 
poorer outcomes. Men who experienced disorderly sequences earned less 
money, had less prestigious jobs, and were more likely to divorce than men 
with orderly sequences. The magnitudes of these deprivations were modest, 
however. In addition, for some men, the costs of disorderliness were 
compensated for by higher socioeconomic achievements. The nature of the 
variability within younger cohorts operated against Hogan's hypothesis: 
orderly sequences were most prevalent among men from lower socioeconomic 
and rural backgrounds. Both groups also exhibited relatively low levels of 
educational and occupational attainment. 

Data permitting cross-cohort comparisons of the sequencing of young adult 
transitions among women are less common. Comparisons of recent cohorts, 
however, suggest that disorderly sequences are more prevalent among women 
than men (e.g. Hogan 1985, Kerckhoff 1990). 


LIFE COURSE MARKERS Transitions that are both highly prevalent and highly 
predictable arc viewed as life course markers (Winsborough 1980). During 
the 20th century some transitions became more predictable; others became 
less so. Transitions that are now less predictable are concentrated in early 
adulthood. The timing of marriage and parenthood exhibit greater variability 
among younger than older cohorts (Cherlin 19882, Modell 1980). These 
transitions also are less predictable now because larger proportions of recent 
cohorts do not marry and/or do not have children (Watkins et a] 1987). Another 
less common and less predictable transition during early adulthood 1s death 
of a minor child. Uhlenberg (1980) estimates that, early in this century, parents 
had a 6296 chance of losing at least one minor child; in 1980, the probability 
had dropped to 4%, no longer representing a life course marker. 

In contrast, transitions during middle and old age have become more 
prevalent and predictable. Death of a parent during middle age is now so 
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predictable that Winsborough (1980) suggests that it is a publicly recognized 
life course marker. As noted previously, voluntary retirement has become a 
predictable, nearly universal transition. Martin Matthews (1987) demonstrated 
that the timing of widowhood has become more homogeneous during the past 
century. She suggests that this increased predictability helps recent cohorts 
of older women to better prepare for widowhood and provides natural support 
groups of widowed age peers. 

Population studies provide important information about the evolution of the 
life course across cohorts and about variability within cohorts. One limitation 
of these studies is the relative lack of attention to mechanisms of change 
(Hogan & Astone 1986). Social change is often interpreted as the driving 
force for cohort differences in transitions. For example, some scholars suggest 
that, in the United States, historically unprecedented affluence experienced 
during the middle of this century accounts for declines in age at first marriage. 
Other observers argue that life course markers during middle and late life 
were unlikely to emerge until life expectancy increased to the point that most 
cohort members lived til advanced ages. These are credible post hoc 
inferences; but they are not empirical tests of hypothesized explanatory 
mechanisms. To date, population-based studies of life transitions have been 
more successful in describing changes in transition patterns than in explaining 
them. 


Individual-Based Studies of Transitions 


Individual-based studies examine the effects of transitions at one point in time 
on subsequent life course outcomes. The emphasis of these studies is 
explication of the processes by which early transitions exert enduring 
influences on later life patterns. Studies in this tradition typically are based 
on longitudinal data; a few investigators have used prospective data covering 
large segments of the life span. 


LINKING HISTORICAL EVENTS, LIFE TRANSITIONS, AND OUTCOMES Elder’s 
pioneering studies (e.g. 1974, 1979) of the effects of the Great Depression 
on subsequent life patterns and achievements became a catalyst for much of 
the research in this tradition. Elder examined two cohorts of children: (i) 
children born in the early 1920s, who were adolescents during the Depression 
and (ii) children born in 1928-29, who were young children during the 
Depression. Within-cohort variability was examined in terms of (i) social class 
prior to the Depression and (ii) the amount of economic deprivation triggered 
by the Depression. In brief, substantial differences were observed across 
cohorts, within cohorts, and between girls and boys. The younger cohort was 
more adversely affected by the Depression than the older one. Within cohorts, 
the strongest effects were found among those whose families suffered the 
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greatest economic losses, children in working- rather than middle-class 
families, and boys rather than girls. Effects of the Depression persisted in 
adulthood. Although socioeconomic achievements did not differ significantly, 
several life domains—including sex roles within the family, patterns of work 
and leisure, and political values—exhibited differences across and within 
cohorts, depending on amount of deprivation. 

The impact of World War II on subsequent life patterns also has been 
studied. Elder et al (1991) examined the impact of World War II military 
service on midlife career achievements among two cohorts of US men. The 
older cohort, who entered adulthood during the Depression and did military 
service later, had lower midlife career achievements than did the younger 
cohort. Military service was unrelated to occupational achievements in the 
younger cohort but was advantageous in the older cohort. 

The effects of World War II participation on subsequent life course patterns 
have been examined in other countries as well. Mayer (1988) compared 
German cohorts born between 1900 and 1936 on adult socioeconomic 
achievements. Men born between 1914 and 1925 were most intensely involved 
in the war effort; more than 75% of the men in those cohorts served in the 
military. Surprisingly, the most severe short- (1945-1948) and long-term 
(1965-1971) negative effects were experienced by men born between 1926 
and 1930, most of whom did not serve in the military or who served only 
briefly. These findings are the opposite of those reported by Elder et al (1991), 
who found older cohorts to fare less well than younger cohorts, and probably 
reflect Germany’s disastrous economic and social conditions in the years 
immediately following the war. Cook (1983) studied post-war career lines of 
men who served in Japan's military elite during World War II. Before and 
during the war, they were among Japan's most honored citizens; after the 
war, their occupations were terminated and the post-war labor market was 
closed to them. Older members of the military elite fared worst in both the 
short- and long-term; although intelligent and well-educated, many had to 
settle for farming or menial labor. Younger members fared better; they were 
permitted to enter educational institutions, and most pursued professional 
careers. 

In terms of more recent history, McAdam (1989) examined the short- and 
long-term consequences of activism during the Civil Rights Movement. In a 
design superior to previous studies, he compared subsequent life course 
experiences of applicants to the 1964 Mississippi Freedom Summer Project, 
some of whom participated and some of whom were accepted for participation, 
but withdrew. Twenty years later, participants differed significantly from 
nonparticipants. The former were more likely to endorse leftist political 
attitudes, more active in politics, disproportionately employed in "helping" 
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professions, more likely to label themselves as “liberal” or “radical,” much 
less likely to be married, and had lower incomes than did the no-shows. 
McAdam views political activism in early adulthood as a “watershed” 
experience that has enduring impact on life course patterns. He also suggests 
that social bonds developed during political activism form a structural tie that 
sustains activist attitudes and behaviors over time. Weil’s research (1987) also 
suggests that political activism during early adulthood has persisting effects. 
In Germany, cohorts who were adolescents and young adults during the Third 
Reich, especially those who participated in Nazi youth groups, endorse less 
democratic values throughout adulthood than did earlier and later cohorts. 


LIFE COURSE EFFECTS OFEARLY EVENTS Other individual-based studies focus 
on the conditions under which events or transitions experienced earlier in life 
affect subsequent life course patterns. The emphasis here is not on historical 
events, but on personal transitions and choices. A broad body of research 
falls under this rubric; studies reviewed here are, of necessity, selective. 

Childhood traumas (e.g. loss of a parent, physical abuse) have been linked 
to subsequent life course patterns. In a classic study, Brown & Harris (1978) 
traced the impact of parental loss during childhood on adult outcomes. Using 
data from a sample of working class women in London, they found that 
parental loss during childhood was associated with increased risk of clinical 
depression, lower socioeconomic achievements, and poorer quality marriages 
during adulthood. Additional analyses suggested that the effects of parental 
loss were mediated by the quality of care the child received after parental 
loss: fewer negative outcomes were observed among women who received 
high-quality care as children from parental substitutes. Although Brown & 
Harris found both parental death and divorce to increase the risk of negative 
adult outcomes, later US studies report that parental divorce has stronger and 
more negative effects than parental death (e.g. Landerman et al 1991, Tweed 
et al 1989). These findings suggest that family dysfunction, rather than 
parental loss per se, may explain the persisting effects of parental loss on 
adult outcomes. Landerman et al (1991) also found that the effects of 
contemporaneous life events on mental health during adulthood are stronger 
for persons who experienced childhood traumas. 

More normative childhood transitions also affect subsequent life course 
patterns. Alexander & Entwisle (1988) found that the transition to school 
(entering the first grade) significantly affects subsequent achievement trajec- 
tories. This transition is more difficult for blacks than for whites, despite 
similar levels of cognitive achievement at school entry. These data strongly 
suggest that the transition to school has both immediate and long-term 
consequences and that racial differences in achievement emerge quickly. 
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Another illustration of the persisting effects of childhood events is the 
relationship between age of onset of mental illness and subsequent life 
course patterns. In the most rigorous study to date, Turnbull et al (1990) 
compared adults who experienced early onset of mental illness (i.e. before 
age 20), late onset, and no mental illness on socioeconomic and family 
outcomes. Analyses were performed for four types of mental illnesses. 
Overall, early onset had signficantly more adverse effects than did late 
onset. For all illnesses but depression, early onset was associated with lower 
educational achievement, lower occupational prestige, lower income, and 
increased unemployment. Similar patterns were observed for adult family 
roles: for all illnesses except depression, early onset was associated with 

‘lower rates of ever marrying and, among the ever married, higher rates of 
divorce. Early onset was associated with lower fertility, however, only 
among those with depression. 

A final illustration of research in this tradition concerns the effects of early 
events and choices on retirement income. As is true for the employed, retired 
men have substantially higher incomes, on average, than do retired women. 
This income differential reflects several differences in the work histories of 
men and women. Compared to women, men typically have longer work 
histories, higher annual and lifetime earnings, and they are more likely to 
work in markets that provide private pensions (DeViney & O’Rand 1988). 
Examination of the joint trajectories of women’s work and family roles helps 
to explicate the processes that generate these sex differences (e.g. O’Rand & 
Henretta 1982, O'Rand & Landerman 1984). Because of family roles, 
especially parenting, women are less likely to work full-time and continuously 
throughout adulthood, decreasing their annual and lifetime earnings. For 
women covered by private pensions, truncated and discontinuous work 
histories result in lower pension income. Women also are substantially less 
likely than men to be employed in markets that provide private pensions. 
Although family transitions and trajectories partially explain sex differences 
in retirement income, they do not account for the social allocation processes 
that result in women disproportionately working without pension coverage 
(O’Rand 1988). 

The individual-based studies reviewed here are highly selective, but they 
document that transitions at one point in time often have long-term 
consequences. As these studies illustrate, individual-based studies have made 
progress in three areas that are generally neglected in population-based 
studies: explication of the mechanisms by which transitions affect outcomes, 
examination of transitions in the context of long-term trajectories, and 
consideration of the joint effects of transitions in multiple domains (e.g. 
family and work). 
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EMERGING THEMES 


Studying transitions in the life course context has been substantially more 
profitable than earlier research based on role theory and social stress theory. 
The link between transitions and life course patterns has been profitable for 
life course theory as well. Two decades ago, Clausen (1972) predicted that 
studying life transitions offered the best chance of generating a theory of the 
life course. Subsequent research supports that prediction. Knowledge of 
transitions is richer as a result of studying the life course; conversely, 
knowledge of the life course has been advanced by studying transitions. 

Research on transitions in life course perspective also has weaknesses. One 
limitation, previously addressed, concerns mechanisms of change. Two other 
problems concern the challenges of heterogeneity and the struggle to integrate 
findings from micro and macrolevel studies. This final section addresses those 
two problems and the increasing cross-fertilization of concepts and methods 
among formerly distinct research traditions. 


Challenges of Heterogeneity 


Heterogeneity is relevant to the origins of transitions, the varying outcomes 
of transitions, and the factors that mediate and buffer the effects of transitions. 
Most research on transitions focuses on modal transition patterns, underplay- 
ing the fact that large proportions of individuals do not fit those patterns (e.g. 
Dannefer 1988). A study by Rindfuss et al (1987) illustrates this point. Using 
data from the National Longitudinal Survey of the High School Class of 1972, 
these authors coded participants’ role sequences for 8 years following high 
school graduation, examining five roles: work, education, homemaking, 
military, and other. They report that 1100 sequences were required to describe 
the experiences of the 6700 men in the sample; 1800 sequences were needed 
to capture the patterns of the 7000 women in the sample. Even the simple, 
two-event sequence of education followed by work applied to only: slightly 
more than half the men and to less than half the women. This level of 
heterogeneity challenges the assumption that transitions are patterned in 
predictable ways, and it complicates research on the consequences of 
transitions. 

Heterogeneity may, in fact, challenge the assumption that there is an 
"institutionalized life course" (Meyer 1986). Kohli et al (1991) describe 
patterns of early retirement using data from seven industrialized nations, 
focusing on the mix of public and private institutional arrangements that shape 
the retirement process. Although there are multiple pathways to retirement 
across and within societies, a common element of these pathways is the demise 
of chronological age as a basis of exit from the labor force. Guillemard & 
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van Gunsteren (1991) suggest that this represents a “deinstitutionalization” of 
the life course in that the major milestone that defines entry to old age is no 
longer standardized or predictable. 

Heterogeneity also is being addressed by increased attention to the ways 
that pre-transition characteristics affect both the occurrence of transitions and 
their outcomes. Caspi et al (1987, 1988) demonstrate that childhood person- 
ality traits affect the timing and outcomes of later transitions. In their studies, 
shy children experienced the transition to adulthood later than did their more 
outgoing peers, and they also had more adverse outcomes. Children with 
explosive interaction styles also experienced adverse outcomes during the 
transition to adulthood. Hagan (1991) studied the socioeconomic achieve- 
ments of two subgroups of adolescents weakly linked to parental and 
educational control: a subculture of deviance and a party subculture. Prefer- 
ence for a delinquent subculture negatively affected early adult status 
attainment. Identification with a party subculture had weaker effects. Both 
subcultures had stronger effects for working class men. Research that focuses 
on the ways that pre-transition statuses affect transition outcomes is similar 
to the recent advances in stress research, noted previously. In both cases, 
attention to heterogeneity has led to a richer understanding of social dynamics. 


Linking Macro and Micro Findings 


Current research also does not adequately link micro and macro evidence 
about the genesis and consequences of transitions, a common problem in 
sociology more broadly. Indeed, Ritzer (1989) suggests that “the issue of 
micro-macro linkage has emerged as the central problematic of sociological 
theory in the 1980s” (italics in the original). This continues to be true in the 
1990s. Accumulated evidence, however, suggests that primary groups, 
especially family and occupational environments, are the major contexts 
within which broad social patterns influence transitions and their outcomes 
(Hagestad 1990). 


FAMILY Changes in family structure appear both to result from macrosocial 
changes and to alter the timing and consequences of transitions (George & 
Gold 1991). Kertzer & Hogan (1988) studied the impact of economic change 
on life course patterns in a nineteenth century Italian community. Their data 
suggest that changes in family structure were the initial responses to changing 
economic conditions, followed by observable effects on the life transitions of 
family members. During the past 30 years, in the United States, timing of 
the transition to parenthood has become less predictable, generating distinctive 
patterns of family structure based on age differences between generations. 
Bengtson et al (1990) suggest that these family structures lie along a continuum 
anchored at one end by “age-condensed” families, in which age differences 
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between generations are small, and “age-gapped” families, in which age 
differences between generations are large. Age-condensed families are gen- 
erated by early fertility; age-gapped families result from delayed fertility. 
Furstenberg et al (1987) traced the life course outcomes associated with early 
fertility. Burton and colleagues (Burton & Bengtson 1985, Hagastad & Burton 
1986) trace the consequences of teenage parenthood on the two older 
generations in the family. The impact of divorce on intergenerational family 
roles also has been examined (Hofferth 1985, McLanahan & Bumpass 1988). 

Increased life expectancy has altered the length of time spent in various 
family roles. For example, using simulated cohort data from census records, 
Gee (1986) compared time spent in specific family roles by Canadian women 
in the 1830s and 1950s. She estimated that during the 1830s, women spent 
fully 90% of their married years rearing minor children. By the 1950s, this 
had dropped to 40%. Patterns for men may be even more dramatic. Because 
of declines in fertility and high divorce rates in recent cohorts, American men 
have experienced a precipitous decrease in the amount of time that they live 
with children. Eggebeen & Uhlenberg (1985) report that, between 1960 and 
1980, white men in the United States experienced a 43% decrease in the time 
spent living with young children, from 12.3 to 7.0 years. During the same 
interval, American black men experienced a 23% reduction, from 15.1 to 
11.6 years. These authors suggest that changes in the duration of family roles 
have important social consequences. Gee speculates that the decline in time 
spent in active parenting has decreased the salience of parenthood and 
increased the importance of marriage during adulthood. Eggebeen & 
Uhlenberg are concerned that the brief time that many men spend living with 
children may erode their commitments to their own children and to investments 
in future generations more broadly (also see Uhlenberg & Eggebeen 1986). 
These are credible interpretations about the impact of changes in duration in 
family roles, but nevertheless they remain speculative. And some authors 
offer alternate explanations (see, for example, Furstenberg & Condran 1988 
for a reconsideration of the Eggebeen & Uhlenberg position). 


OCCUPATIONS Occupations are a strategic context for studying life transi- 
tions. Job shifts (vertical and lateral moves, as well as movement in and out 
of the labor force) are transitions, and careers are the trajectories within which 
job shifts occur. In the past decade, sociological interest in job shifts has 
increased (for a recent review of this research base, see Rosenfeld 1992). 
Much of the recent research has taken advantage of event history methods to 
examine the degree to which job shifts are duration dependent (a concept 
more broad]y useful in life course research as well, see Featherman & Lerner 
1985). The general expectation is that duration dependence is negative, i.e., 
that voluntary job shifts decline over time. Most research results support this 
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expectation, although there are inconsistent findings. One factor creating both 
estimation and interpretation problems in understanding the dynamics of job 
shifts is the typical confounding of duration, age, and seniority. Job shifts 
and careers would seem to have obvious links with life course perspectives. 
As yet, however, job mobility research has paid little attention to life course 
concepts. Rosenfeld suggests that infusion of life course concepts into job 
mobility research would help (i) to disentangle age and duration effects in job 
shifts, (ii) to focus increased attention on the transitions that anchor the 
beginnings and ends of careers (job entry and retirement), and (iii) to overcome 
the relative neglect of the ways in which other life domains and the life courses 
of family members shape job shifts and careers. Indeed, intersections of work 
and family roles have received considerable attention in previous life course 
research. The joint effects of work and family roles on retirement income of 
women were reviewed previously. The interacting effects of work and family 
roles also have been examined earlier in adulthood (e.g. Bielby & Bielby 
1989, Wheaton 19902). 

Research on job shifts and careers also offers evidence about the ways in 
which broader social patterns affect individual transitions. Substantial atten- 
tion has been paid to the ways in which job shifts are a function of opportunity 
structures in specific firms or internal markets (see Althauser 1989 for a 
review). These intermediate structures, in turn, are responsive to broader 
changes at the organizational, national, and political levels (e.g. DiPrete & 
Krecker 1991, Sorenson & Tuma 1981). 

The study of job transitions served as the basis for one attempt to develop 
a generic model of transition behavior. Nicholson (1984, 1990) used data on 
promotions and job shifts to develop the "transition cycle" model, which he 
claims can be applied to all transitions. The transition cycle includes four 
steps: preparation, encounter (occurrence), adjustment (short-term responses), 
and stabilization (long-term accommodation). The model is highly abstract, 
with few clues as to how it could be applied empirically. Nicholson's model 
is similar to Ebaugh's model of role exits, reviewed earlier. Both authors wish 
to delineate a single model that will apply to an entire class of transitions. 
Unfortunately, I fear that they are doomed to fail in this quest. The vast 
majority of research suggests that transitions are too heterogeneous and too 
dependent upon social context to be captured by a single, generic model. 
Nonetheless, the occupational arena provides a strategic context for the study 
of life transitions—and the integration of macro and micro findings has been 
more successful here than in other social contexts. 


Cross-Fertilization 


Perhaps the most important emerging theme in research on transitions is 
cross-fertilization among the previously distinct traditions of role theory, 
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social stress, and life course sociology. The boundaries among these research 
traditions are increasingly and appropriately indistinct. Pearlin (1983) was one 
of the first to articulate the utility of cross-fertilization across these research 
domains. He prescribed examination of life events and transitions in the 
context of social roles, noting that this would help to remedy neglect of the 
"structured and durable social and economic antecedents of stress." He 
expanded this theme in a more recent essay (Pearlin 1989). Similarly, Cowan 
(1991) notes that an adequate model of transitions must (i) apply to both 
normative and nonnormative transitions, (11) examine both roles as contexts 
within which transitions emerge and role reorganization as an important 
outcome of transitions, and (iii) apply across the life course, thus calling for 
integration of themes from all three research traditions. 

Empirical studies that cross the boundaries of life course, stress, and role 
theories illustrate the payoff of cross-fertilization. Many of the studies 
described earlier, especially those that document the persisting effects of 
childhood deprivations on adult achievements and outcomes, incorporated 
elements of multiple perspectives. Other examples are more explicit, in that 
the authors announce their intentions to cross paradigms. Combining elements 
of role and stress theory, Wheaton (1990b) demonstrated that role histories 
moderate the effects of transitions on mental health. Another of Wheaton's 
studies links life course and stress perspectives (Wheaton & Roszell 1992). 
Results of that study indicated that the effects of current stress on mental 
health were conditioned by the level of cumulative lifetime stress. Both very 
high and very low levels of cumulative stress were associated with increased 
risk of mental health problems in the face of current stress, moreso than were 
moderate levels of cumulative stress. Two recent edited volumes offer multiple 
illustrations of research that bridges at least two of the research traditions. 
Eckenrode & Gore's edited volume (1990) examines the links between stress 
and family and work roles, although life course issues are not explicitly 
considered. Cowan & Hetherington’s book (1991) focuses on family transi- 
tions, and all eight empirical papers rely in part on role theory and life course 
perspectives. Several also focus explicitly on stress. Fortunately, research 
based on cross-fertilization of these research traditions seems to be gathering 
momentum. 


CONCLUSION 


Research on life transitions has become a growth industry in sociology. The 
emergence of life course sociology during the past two decades provided the 
most fertile field in which to examine the dynamics, heterogeneity, genesis, 
and outcomes of life transitions. Role theory and social stress theories made 
important but limited contributions to the study of life transitions. The most 
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promising current work, however, indicates that integrating life course 
perspectives, social stress theory, and role theory yields the richest under- 
standing of life transitions. The study of life transitions also parallels 
developments in sociology more broadly. Issues that challenge both the study 
of life transitions and the sociological enterprise as a whole include depicting 
and explaining the dynamics of change, revealing and accounting for 
heterogeneity, identification of the mechanisms by which social contexts shape 
human lives, and integration of macro and micro patterns. It seems reasonable 
to conclude that, within the field of life transitions, progress is being made 
on all these fronts. 


ACKNOWLEDGMENTS 


Preparation of this chapter was supported in part by a grant from the National 
Institute on Aging (AGOO00371). I am grateful to Glen Elder for very useful 


comments on earlier drafts of this chapter. 


Literature Cited 


Alexander, K L., Entwisle, D.R. 1988. 
Achievement ın the first 2 years of school 
Pattems and process Monogr Soc Res 
Child Dev 53:1—-140 

Allen, V L., van de Vliert, E. 1984. Role 
Transitions Explorations and Explanations. 
New York Plenum 371 pp. 

Althauser, R P. 1989 Internal labor markets 
Annu. Rev. Sociol. 15 143-6! 

Bengtson, V , Rosenthal, C , Burton, L 1990. 
Families and aging’ Diversity and heteroge- 
neity See Binstock & George 1990, pp 
263-87 

Biddle, B. 1979 Role Theory Expectations, 
Identities, and Behaviors New York: Aca- 
demic. 416 pp. 

Bielby, W T., Bielby, D. D 1989. Family 
tes Balancing commitments to work and 
family in dual earner households Am. 
Sociol, Rev. 54 776-89 

Binstock, R H., George L. K., eds 1990 
Handbook of Aging and the Social Sciences 
San Diego. Academic. 3rd ed 

Brum, O. G , Jr 1966 Socialization through 
the life cycle In Socializaton After 
Childhood, ed O G Bnm, Jr, S. 
Wheeler, pp. 3-49 New York: John 
Wiley. 116 pp. 

Brown, G. W., Harris, T O. 1978 Social 
Origins of Depression. A Study of Psychiat- 
ric Disorder in Women. London: Tavistock 
429 Pp. 

Burton, L M., Bengtson, V. L. 1985. Black 

dmothers: Issues of timing and continu- 
ity of roles. In Grandparenthood, ed. V. L. 


Bengtson, J F Robertson, pp. 61-77. Bev- 
erly Hills. Sage. 212 pp. 

Caspi, A., Elder, G. H. Jr., Bem, D J 1987. 
Moving against the world. Life-course pat- 
terns of explosive children. Dev Psychol 
23 308-13 

Caspi, A , Elder, G H. Jr , Bem, D. J 1988 

oving away from the world Life-course 
patterns of shy children. Dev. Psychol. 
24:824—31 

Cherlin, A J 1988a. The changing American 

family and public policy See Cherlin 1988, 
1-30 

Cherin, A.J. ed 1988b The Changing Amer- 
ican Family and Public Policy. Wüsslbugfoit 
DC Urban Inst. Press. 261 pp 

Clausen, J. ed. 1968 Socialization and Soci- 
ety. Boston: Little, Brown. 247 pp. 

Clausen, J 1972 The hfe course of individu- 
als. In Aging and Society, vol. 3, A Soctol- 
ogy of Age Stratification, ed. M.,W. Riley, 
M. Johnson, A. Foner, pp. 457-514 New 
York Russell Sage Found. 652 pp. 

Cook, T. F , Jr 1983. Cataclysm and career 
rebirth: The imperial elite. In Work 
and Lifecourse in Japan, D W. Plath, 
pp 135-52. Albany: State Univ. New York 
Press. 180 pp. 

Cowan, P A. 1991. Individual and family life 
transitions: A proposal for a new definition 
See Cowan & Hethenngton 1991. pp. 3-30 

Cowan, P. A , Hetherington, M. ade 1991. 
Family Transitions. Hillsdale, NJ; Erlbaum 
Assoc. 331 pp. 

Dannefer, D. 1988 What's in a name? An 


account of the neglect of variability in the 
study of aging. In Emergent Theories of 
Aging, ed. J E. Birren, V L Bengtson, pp 
356-84. New York Springer 530 pp. 

DeViney, S , O'Rand, A M 1988 Gender- 
cohort succession and retirement among 
older men and women, 1951-1984 Sociol 
Q. 29.525-40 

DiPrete, T A , Krecker, M. L. 1991 Occu- 
pational linkages and job mobility within and 
across organizations. Am Sociol Rev 53. 
26-40 


Dohrenwend, B. P., Dohrenwend, B S. 1969 
Social Status and Psychological Disorder: A 
Causal Inquiry. New York: John Wiley 218 


Pp 

Dohrenwend, B P., Link, B. G , Kern, R., 
Shrout, P E., Markowitz, J 1987. Measur- 
ing life events. The problem of variability 
within event categories. In Psychiatric Epi- 
demiology Progress and Prospects, ed. B 
Cooper, pp 103-19 London Croom Helm. 
413 pp. 

Ebaugh, H.R F 1988 Becoming an Ex The 
Process of Role Ext Chicago. Univ. Chi- 
cago Press. 247 pp 

Eckenrode, J., Gore, S , eds 1990 Stress 
Between Work and Family New York Ple- 
num. 223 pp. 

Eggebeen, D., Uhlenberg, P 1985 Changes 
in the o tion of men's lives: 1960- 
1980 Fam Relat 34.251-57 

Elder, G H Jr 1974. Children of the Great 
Depression Chicago Univ. Chicago Press. 


400 pp. 

Elder, G H Jr 1979 Historical change tn life 
patterns and personality In Life-Span Devel- 
opment and Behavior, vol 2, ed P. B 
Baltes, O. G. Brım, Jr, pp 117-59 New 
York. Academic 348 pp. 

Elder, G H Jr 1985a Perspectives on the life 
course See Elder 1985b, pp. 23-49 

Elder, G H Jr, ed. 1985b Life Course 
Dynamics Ithaca’ Cornell Univ. Press 345 


Pp 

Elder, G. H. Jr , Pavalko, E. K., Hastings, T. 
J 1991 Talent, history. and the fulfillment 
of promise. Psychiatry 54.251—67 

Featherman, D. L, Lemer, R M 1985 
Ontogenesis and sociogenesis. Problem for 
theory and research about development and 
socialization over the liiespan Am Sociol 
Rev 50:659—76 

Featherman, D L., Sorensen, A 1983 Soci- 
etal transformation in Norway and change in 
the life course transition into adulthood Acta 
Sociol. 26:105—26 

Furstenberg, F F , Jr , Brooks-Gunn, J., Mor- 
gan, S P 1987 Adolescent Mothers in Later 
Life. New York’ Cambridge Univ. Press 
322 pp 

Furstenberg, F F , Jr., Condran,G A 1988 
Family change and adolescent well-being A 


LIFE TRANSITIONS 371 


reexamination of U S. trends. See Cherlin 
1988b pp 117—56 

Gee, E. 1986 The life course of Canadian 
women A historical and demographic anal- 
ysis. Soc. Indicators Res. 18:263—83 

Gee, E M. 1987. Historical change in the 
family life course of Canadian men and 
women. See Marshall 1987, pp 265-87 

George, L K 1980 Role Transitions in 
Later Life Monterey, Calif? Brooks/Cole. 


159 p 

George, t K. 1989. Stress, social support, and 
depression over the life course. In Aging, 
Stress and Health, ed. K. S. Markides, C 
L Cooper, pp 241-67 Chichester, En- 
gland: John Wiley 290 pp 

George, L. K , Gold, D. T. 1991 Life course 
perspectives on intergenerational and gener- 
ational connections. Marriage Fam. Rev 
16 67-88 

Granovetter, M 1985 Economic action and 
social structure. The problem of embedded- 
ness Am J Sociol 91 481—510 

Guillemard, A M , van Gunsteren, H. 1991. 
Pathways and prospects: A comparative in- 
terpretation of the meaning of early exit See 
Kohli et al 1991, pp 362-88. 

Hagan, J 1991 Destiny and drift The risks 
and rewards of youth. Am. Sociol. Rev. 
56 567-82 

Hagestad, G O 1990. Social perspectives on 

e life course. See Binstock & George 1990, 
pp 151-68 

Hagestad, G O , Burton, L 1986 Grandpar- 
enthood, life context, and family develop- 
ment Am. Behav Sci. 29.471-84 

Hagestad, G O , Neugarten, B L 1985. Age 
and the lıfe course. In Handbook of Aging 
and the Social Sciences,.ed. R. H Binstock, 
E. Shanas, pp. 35-61. New York. Van 
Nostrand Reinhold. 2nd ed. 909 pp. 

Hofferth. S. L 1985 Children's life course: 
Family structure and living arrangements in 
cohort perspective See Elder 1985b, pp. 
75-112 

Hogan, D P 1978 The variable order of 
events in the life course Am. Sociol Rev. 
43 573-86 

Hogan, D P. 1981 Transitions and Social 

hange: The Early Lives of American Men 
New York. Academic. 232 pp. 

Hogan, D P 1985 The demography of life- 
span transitions. Temporal and gender com- 
parisons In Gender and the Life Course, ed 
A. Rossi, pp. 65-78. New York. Aldine. 
286 pp 

Hogan, D. P, Astone, N M 1986 The 
transition to adulthood. Annu Rev. Sociol. 
12:109-30 

Holmes, T H , Rahe, R. H. 1967. The Social 
Readjustment Rating Scale. J Psychoso- 
matic Res 11°213-18 

Jacobson, D. 1989 Context and the sociolog- 


372 GEORGE 


ical study of stress J Health. Soc Behav. 
30:257—60 

Kerckhoff, A C 1990 Getting Started Tran- 
sition to Adulthood in Great Britain Boul- 
der, Colo: Westview 216 

Kertzer, D I, Hogan, D BP 1988. Family 
structure, individual lives, and societal 
change See Riley 1988, pp 63-82 

Kessler, R C. 1979 Stress, social status, and 
psychological distress J Health Soc. 
Behav. 20.259—72 

Kohli, M., Rein, M , Guillemard, A M., van 
Gunsteren, H ed. 1991. Time for Retire- 
ment. Cambridge Cambridge Univ Press 
398 pp 

Landerman, R., George, L K , Blazer, D G 
1991 Adult vulnerability for chiatric 
disorders. Interactive effects of negative 
childhood experiences and recent stress J 
Nery Mental Disease 179 656-63 

Langner, T S, Michael, S T. 1963 Life 
Stress and Mental Health Toronto Free 
Press of Glencoe. 517 pp. 

Lin, N 1992 Stress. In Encyclopedia of Soci- 
ology, Vol 4, ed E. F Borgatta, M L. 
Borgatta, pp 2096-2101. New York: Mac- 
Millan. 2359 pp. 

Linton, R. 1936 A Study of Man. New York: 
Appleton-Century-Croft. 186 pp 

Marshall, V W., ed. 1987 Aging in Canada 
Markham, Ontario Fitzhenry & Whiteside 
613 pp 2nd ed 

Martin Matthews, A 1987 Widowhood as an 
expectable life event See Marshall 1987, 
pp 343-66 

Mayer, K U 1988 German survivors of 
World War II. The tmpact on the life course 
of the collective experience of birth cohorts. 
See Riley 1988, pp. 211-28 

McAdam, D 1989 The biographical conse- 
quences of activism Am. Sociol Rev 54 
744-60 

McLanahan, S., Bumpass, L 1988 Inter- 
generational consequences of family disrup- 
ton Am J Sociol 94:130-52 

Meyer, J W 1986. The self and the life course 
Institutionalization and its effects In Human 
Development and the Life Course. Multidis- 
ciplinary Perspectives, ed. A. B. Sorenson, 
F E Weinert, L R Sherrod, pp 199-206 
Hillsdale, NJ: Lawrence Erlbaum. 396 pp 

Modell, J 1980. Normative as of Amer- 
ican marriage timing since World War II J 
Fam. Hist. 5 210-34 

Modell, J., Furstenberg, F F., Jr., Hershberg, 
T. 1976. Social change and transitions to 
adulthood 1n historical perspective J Fam 
Hist 1 7-32 

Nicholson, N 1984 A theory of work role 
transitions Admin Sci 29 172- 91 

Nicholson, N. 1990 The transition cycle. 
Causes, outcomes, processes, and forms In 
On the Move. The Psychology of Change and 


Transinon, ed. S Fisher, C. L Cooper, pp. 
83-108. New York: John Wiley 374 pp. 
O'Rand, A M 1988. Convergence, institu- 
tionalization, and bifurcation’ Gender and 
the pension acquisition process Annu Rev. 
Geron. Geriatr 9.132-55 

O’Rand, A. M., Henretta, J. C. 1982. Delayed 
career entry, industrial pension structure and 
early retirement in a cohort of unmarried 
women Am. Sociol Rev. 47.365—73 

O’Rand, A. M, Landerman, R. 1984. 
Women's and men's retirement income: 
Early family role effects Res on Aging 
6 25-44 


Pearlin, L. I. 1983. Role strains and personal 
stress In Psychosocial Stress Trends in 
Theory and Research, ed. H B. Kaplan, pp. 
3-32 New York Academic. 346 pp. 

Pearlin, L. I 1989 The sociological study of 
stress. J Health Soc Behav 30:241—56 

Pearlin, L. I , Lieberman, M , Menaghan, E., 
Mullan, J 1981. The stress process J. 
Health Soc Behav 22 337-56 

Rabkin, J G., Struening, E L 1976. Life 
events, stress, and illness Science 194 
1013-20 

Riley, M. W , ed 1988 Social Structures and 
Human Lives. Newbury Park, Calif: Sage. 
368 pp. 

Rindfuss, R R, Swicegood, G G., 
Rosenfeld, R A 1987 Disorder ın the life 
course How common and does ıt matter? 
Am. Sociol Rev 52°785-801 

Ritzer, G 1989 Sociology of work. A 
metatheoretical analysis. Soc Forces 67 
593—604 

Rosenfeld, R. A. 1992. Job mobility and career 
processes. Annu Rev. Sociol. 18.39—61 

Rossi, A. S 1980 Life-span theories and 
women's lives Signs 6.4—32 

Sorenson, A. B , Tuma, N. B. 1981. Labor 
market structures and job mobility Res Soc 
Strat Mobil 1.67—94 

Srole, L , Langner, T S , Michael, S. T, 
Opler, M K., Rennie, T. A. C. 1962 
Mental Health in the Metropolis: The Mid- 
town Manhattan Study New York: McGraw 
Hill 593 pp. 

Thoits, P. A. 1987. Gender and marital status 
differences 1n control and distress Common 
versus unique stress explanations J Health 
Soc Behav 28:7-22 

Tuma, N B , Sandefur, G. D. 1988. Trends 
in the labor force activity of the elderly in 
the United States, 1940-1980 In Issues in 
Contempory Retirement, ed R Ricardo- 
Campbell, E P Lazear, pp. 38-75. Stan- 
ford, Calıf. Hoover Inst. Press 312 pp 

Tumbull, J E., George, L K., Landerman, 
R., Swartz, M. S., Blazer, D G. 1990 
Social outcomes related to age of onset 
among chiatric disorders. J. Consult 
Clin Psychol 58.832—39 


Tweed, J. L , Schoenbach, V. J , George, L 
K ,Blazer, D G 1989 Effects of childhood 
ntal death and divorce on six-month 
istory of anxiety disorders Br J Psychia- 
try 154,823--28 
Uhlenberg, P 1980 Death and the family J 
Fam. Hist. 5.313-20 
Uhlenberg, P , Eggebeen, D 1986 The de- 
clining well-being of American adolescents. 
Public Interest 82 25—38 
Watkins, S. C , Menken, J. A., Bongaarts, J 
1987. Demographic foundations of family 
change Am. Sociol Rev. 52 46-58 
Weil, F. D. 1987 Cohorts, regimes, and the 
legitimation of democracy: West Germany 
since 1945 Am Sociol Rev. 52.308—24 


LIFE TRANSITIONS 373 


Wheaton, B 1990a Where work and family 
meet: Stress across social roles See 
Eckenrode & Gore 1990, pp 153-74 

Wheaton, B 1990b Life transitions, role bis- 
tones, and mental health Am. Sociol Rev 
55 209-23 

Wheaton, B , Roszell, P 1992. Lifetime mod- 
els of stress exposure Pap. pres Annu 
Meet. Am Sociol Assoc, Pittsburgh, 
Penn 

Winsborough, H H 1980 A demographic 
approach to the life cycle. In Life Course: 
Integrative Theortes and Exemplary Popula- 
trons, ed K W. Back, pp 65-78. Boulder, 
Colo. Westview 173 pp 


Annu Rev Sociol 1993 19 375-99 
Copyright © 1993 by Annual Reviews Inc All rights reserved 


SOCIAL ORGANIZATION AND 
RISK: Some Current Controversies 


Lee Clarke 


Department of Sociology and Center for Environmental Communications, Rutgers 
University, New Brunswick, New Jersey 08903 


James F. Short, Jr. 
Social and Economic Sciences Research Center, Washington State University, 
Pullman, Washington 99164-4014 


KEYWORDS risk analysis, institutions, constructionism, high reliability organizations, disas- 
ters 


Abstract 


The sociology of risk is fast becoming a recognized field of theory and 
research. We consider and evaluate the following key issues, debates, and 
theories in this emerging field: (1) The argument over whether technological 
disasters are different from natural disasters, (ii) the contributions of social 
constructionism to theories of risk, (iii) how recent work on trust and fairness 
can reorient scholarship on risk perception, (iv) the theory of human error, 
and why organizational, institutional, and cultural factors are more important, 
(v) the arguments about organizations that appear to be error-free (i.e. high 
reliability organizations), (vi) the paradox that we must rely on organizations 
for responding to disasters though organizations may be precisely the wrong 
social instruments for such response. Throughout we emphasize the role of 
organizations and institutional contexts in creating, assessing, and responding 
to risk. 


INTRODUCTION 


Sociological concern with risk developed slowly prior to Charles Perrow's 
Nominal Accidents and James Short's American Sociological Association 
presidential address, "The Social Fabric at Risk," both in 1984. Carol 
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Heimer’s 1988 Annual Review of Sociology chapter established the sociolog- 
ical relevance of psychological work on risk estimation and individual decision 
making under uncertain conditions. A large body of accumulated work 
complements Heimer’s review, helping wrestle the intellectual agenda from 
engineering, systems analysis, economics, and psychology. 

Professionally, there certainly 1s a developing sociology of risk. Every year 
since Short’s 1984 address, at least one regular session has been devoted to 
“Risk” at the annual meetings of the American Sociological Association The 
number of articles and books with “risk” in the title, or in the theoretical 
framing of arguments, is increasing. Too, anonymous reviewers of articles, 
books, and grant proposals are now more likely to grant basic premises of 
risk related research than was the case in the past. 

But professional and intellectual advancement do not correspond perfectly. 
Risk scholarship flows from work on technology, natural disasters, individual, 
group, and organizational decision-making, politics, stratification, environ- 
ment, and social impact assessment (Freudenburg 1986, Dietz et al 1993) 
Depending on many areas, and at a young age, the field has no integrating 
theory. In itself this is not damning Even fairly established subfields lack 
unifying theories. Still, there is something vaguely muddled, though not 
entirely incoherent, about the field. Part of the confusion comes from 
insufficient attention to defining and using key concepts; another part is lack 
of agreement on the field’s orienting debates. Here we review some of those 
debates. 

Perhaps the most intellectual progress has been made with approaches 
concerned with how risk identification, evaluation, management, creation, 
and social meaning (cf Renn 1992) are shaped by structural and institutional 
contexts. The topics we chose to cover illustrate the importance of such 
contexts. We suggest other ways the field can advance in the conclusion. 


ARE TECHNOLOGICAL HAZARDS SPECIAL? 


One argument among scholars and practitioners concerned with hazards and 
related risks focuses on differences and similarities between technological and 
natural hazards. Disaster research has been a professional area for 50 years 
(Sorokin 1942, Fritz 1961) and has been reviewed here twice (Quarantelli & 
Dynes 1977, Kreps 1984). Recently, some have asked whether technological 
and natural hazards (or risks or disasters) are in fact different. Kasperson & 
Pijawka (1985:12, 17) think they differ because “unfamiliarity,” “newness,” 
and “lack of accumulated experience” create different “management prob- 
lems” for organizations that must ameliorate damage (see also Cvetkovich & 
Earle 1985); technological disasters also elicit "different patterns of response" 
from the public. 
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Couch & Kroll-Smith (1985, 1991, Kroll-Smith & Couch 1990) initially 
argued strongly for difference. Their key distinction rested on the time 
dimension of a threat. Natural calamities, they claimed, advance in a 
reasonably clear set of stages—‘order, to chaos, to the reconstitution of 
order —that ended in social system re-equilibration (Kroll-Smith & Couch 
1990:163, cf Dynes 1970). The chief equilibrating mechanisms are the 
altruistic, “communal associations which function to reassert the power of the 
group over nature by reconstituting routines that reinvest daily life with a 
sense of permanence and predictability” (Kroll-Smith & Couch 1990:165). 
By contrast, sequencing of technical disasters 1s less linear, and appropriate 
responses are not as clear: “The community will lack a clear-cut and 
unambiguous reference point from which to collectively determine its identity 
as a community in crisis (Couch & Kroll-Smith 1985:567). Ambiguity 
regarding whether a disaster agent is harmful, and indeed whether a disaster 
has even happened, make therapeutic communities less likely to form 
(Cuthbertson & Nigg 1987). 

Recently Kroll-Smith & Couch (1991) abandoned the distinction, arguing 
"the real issue is not the quality of the disaster agent per se, but whether or 
not it significantly alters the relationships between a community, its built, 
modified or biophysical environments, and how people interpret and experi- 
ence the changes in those environments” (Kroll-Smith & Couch 1991:361). 
They name this view the ecological-symbolic perspective and base it on two 
ideas. “People exist in exchange relations with . environments" and “disrup- 
tions in the ordered relationships between individuals, groups and communi- 
ties, and their. environments, are labeled and responded to as hazards and 
disasters" (Kroll-Smith & Couch 1991:361). The advantage of the conception, 
they say, is it "encourages us to shift our view of [event sequences] from that 
of objective markers to subjective apprehensions of the place a person or 
group occupies in the disaster cycle" (Kroll-Smith & Couch 1991:363). This 
conception 1s more sociological than attributing inherent meaning to disaster 
agents. While radioactivity and hurricanes are dangerous, they do not 
necessarily lead people to think or act in a certain way. 

This is the view long held by disaster researchers. Russell Dynes and E. 
L. Quarantelli, who have been among the pioneers of systematic disaster 
research, have maintained for over 20 years that the qualities of the disaster 
agent are sociologically irrelevant (Quarantelli & Dynes 1978). Listen to 
Dynes (1970:51): “While disaster agents are socially disruptive, one cannot 
understand this disruption solely from knowledge about the agent." Now 
Quarantelli (1992): “a disaster can be identified only in terms of some features 
of a social occasion, that is, some characteristics of the individuals and groups 
reacting in the situation." 

In other words, the meaning 1s in the response, as symbolic interactionists 
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put it. Looking to the established disaster literature might tempt one to 
conclude, with Dynes, that the distinction between technical and natural 
hazards “has been made more recently by risk analysts looking at technology 
and by some sociologists who, like Columbus, think they have discovered a 
new world” (personal communication). 

Erikson (1990, 1991) suggests another way of categorizing hazards that 
distinguishes between toxic and nontoxic threats. He claims, with typical 
elegance, that toxic hazards, many of which involve technology, prompt 
victims to use different interpretive frames to make sense of disasters. Toxic 
contamination renders seemingly innocuous or even beneficial things (the air 
we breathe, the water we drink, the sea and the soil that nourish the food we 
eat) quite dangerous. Further, the extent of physical damage, and the cognitive 
boundaries (Zerubavel 1991) of Three Mile Island, Bhopal, and Chernobyl 
are more uncertain compared with devastating hurricanes and floods. Some 
toxic risks can effectively threaten forever (nuclear waste, nuclear weapons). 

These remarks suggest several ideas about the argument over how to 
characterize hazards. First, as disaster researchers insist, natural hazards 
involve human and technological components in etiology and management. 
The poor often reside in vulnerable buildings and locations (e.g. flood plains), 
probably because they have little choice in such matters (Tierney 1989, 1992). 
Damage to residential units in recent California earthquakes disproportionately 
displaced poor people and minorities (Bolton et al 1992). The more affluent 
have more choices and sometimes choose—presumably for lifestyle reasons— 
to build in places, and with materials, that are vulnerable to natural hazards 
such as fire and mud slides. While weather patterns produce hurricanes, 
damage occurs primarily because populations are concentrated, building 
regulations are poor, power lines are not buried, etc. 

Second, humans, not physical environments, build technical systems that 
can melt down, poison, and incinerate. Perhaps this matters more for practical 
than intellectual reasons, but it's not trivial. Thus failing to distinguish 
hazard-types in some way obscures choice and agency in the creation of 
technological risks. Also, few natural disasters are entirely unexpected, while 
technological ones are more likely to surprise, and baffle, operators, planners, 
decision makers, and regulators. 

Third, the interesting questions concerning how to characterize hazards and 
risks are empirical, not definitional. Three key sets of these questions concern 
(i) the social construction of blame, (1i) ambiguity in determining who the 
victims are and how much damage they have suffered, and (111) the probability 


! After the Oakland/Berkeley Hills fire in the fall of 1991 there was an effort, which ultimately 
failed, to require fire resistant roofing material 1n reconstruction. Although causal attribution 1s 
difficult, the wood products industry lobbied against the effort. 
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that victims’ grievances gain attention from organizations. Perhaps future 
research will discover an association between toxic risks and the likelihood 
to blame human, or organizational, agents. Of course, the definition of toxic 
is also constructed, and some agents are more credibly toxic than others, 
depending on who does the defining. But just because something is socially 
constructed doesn’t make it less real. Theorizing blame means understanding 
the circumstances under which blame develops, and understanding how 
institutional and organizational forces shape moral vocabularies of blame and 
distributions of power. These are matters of research, not contentious 
presumption. 


RISK AND SOCIAL CONSTRUCTIONISM 


Social constructionism is important in risk scholarship in ways other than the 
argument over how to classify hazards. Constructionist arguments reject 
notions of objective reality, asserting fundamental ambiguity in social relations 
and meaning (Berger & Luckmann 1967, Brehmer 1987, Gamson & Mod- 
igliani 1989). On accepting those assumptions, the task is to explain why and 
how realities are constructed in particular ways, which inevitably raises the 
problems of interests, power, and social conflict. 

Hilgartner (1992) points out that treating perception—psychology’s dom- 
inant interest—as a dependent variable ignores prior issues. The usual 
approach is to study some controversial case (e.g. a nuclear plant) or general 
risk perceptions. We then investigate how members of the public react to 
the plant, what they fear and why, or in what ways those fears motivate 
action. Consider the widelv cited article, “The social amplification of risk,” 
by Kasperson et al (1988), which develops an engineering-type signal 
amplification and attenuation model of risk perception. It is a broader 
argument than usual studies of perception because of the way it incorporates 
cultural processes and institutional arrangements in accounting for how risk 
information is transmitted and fears are formulated. But the argument’s 
focus is still how responses to a “risk event” are amplified or attenuated 
(cf Rayner 1988). Such an approach accepts as given the “risk object,” as 
Hilgartner calls it (risk objects are technologies or activities to which social 
agents ascribe the ability to cause harm). The thing-to-explain remains the 
malleability of risk perception. 

Constructionism upends the issue, asserting the malleability of the msk 
objects themselves. Such objects might be dangerous in some objective 
sense—think of a nitroglycerin factory in San Francisco—but the basic 
sociological task is to explain how social agents create and use boundaries to 
demarcate that which is dangerous. Definitions of danger, as Douglas (1966, 
Douglas & Wildavsky 1982) argued some time ago, are socially constructed. 
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Why then focus on one thing as the cause of some harm (or potential harm), 
when multiple attribution is, at least in principle, possible? 

Consider that about 45,000 people die every year on US highways. The 
usual question asked of such data is why the public worries about a one in a 
million cancer risk from toxic chemicals, when there are obviously far more 
dangerous things to worry about. But another question is. What causes the 
carnage? The obvious answer focuses on drivers, or the oft-heard explanation 
for nearly all accidents, “human error’ (on which more below). From a 
constructionist perspective, however, there 1s no objective reason to focus 
solely on car drivers. We could just as reasonably concentrate on road design, 
lack of public transportation, or automobile engineering. Still another con- 
structionist angle might concentrate on how the 45,000 figure is created in 
the first place (1.e. how they are counted, defined, or how the data are collated 
to create the "fact"). The most obvious manifestations of reality do not indicate 
functional necessity. 

Social constructionism helps unmask appearances of objectivity. Though 
hardly novel to sociologists, this corrective is important in policy arguments 
about how and why resources ought to be allocated for risk-amelioration. 
Organizations and elites who make such decisions, especially in large 
corporations and federal regulatory agencies, still hew to the line that the 
problem with rsk acceptability is insufficient and low quality information. 
The normative theory behind this line of thought holds that if only the reality 
can be ascertained, prescriptions for action will be self evident. 

While no one would deny we need more and better information, the view 
has major problems. One is the assumption, palpably false, that information 
can resolve value conflicts. Another is that organizations often have too much, 
not too little, information available in decision situations (Feldman & March 
1981) This is especially a problem under conditions of ambiguity because it 
presents decision makers with the extra burden of determining the relevance, 
reliability, and validity of information without established rules for doing so. 
Problems of bounded rationality, intrinsic to large organizations and complex 
problems, are not attenuated with information overload. 

Probably the central contribution of constructionism is that it raises the 
question: Who does the constructing? Gusfield's (1981) dissection of “the 
myth of the killer drunk" informs much work on the social construction of 
risk. He intended his argument, which was based on a thorough dissection of 
the evidence and argument concerning the drunk-driver issue, as a contribution 
to rhetorical and dramaturgical analysis. Indeed, he averred that he avoided 
the role of "interests, values, and sentiments of particular groups, classes, or 
occupations" in producing drinking-driving knowledge (Gusfield 1981:187). 
But as Gusfield recognized, killer drunk rhetoric would have little influence 
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were it not for the power of institutional actors. Those actors—all organiza- 
tions, incidentally—created the popular (and politically important) notion that 
drunk driving is a new, and especially devastating, social problem and that 
individuals with deep moral failings are responsible for ipo 

Constructionist arguments—again, in contrast to psychology and also 
economics—tend to focus on how power works in framing terms of debate 
about risks (see, e.g., Stallings 1990). This contribution is important because 
of assertions—the normative character of which often goes unrecognized— 
that public fears regarding technical risks are irrational. These fears, some 
argue, are whipped up by a sensationalistic media and result in poor public 
policy. Embedded in such critiques 1s a theory of how society works and how 
risk decisions are made That theory says: (1) policies are a more or less direct 
reflection of public preferences, (ii) public preferences are created by 
newspapers and television, and (iii) hazards or potential hazards are assumed 
to be primarily technical matters and thus best left to experts (see e.g. Starr 
1969). 

Such a theory is in many respects simply wrong. À considerable body of 
evidence demonstrates that the biggest influences on social policy come from 
variously organized interest groups, not from an undifferentiated *public."? 
Also, the theory neglects expert biases and limitations in technical knowledge, 
ones widely recognized by social scientists but often denied ın policy making 
arenas (Freudenburg 1988, Shrader-Frechette 1992). The self-correcting ethos 
in science presumes theoretical and empirical error, but social policies usually 
do not; such policies also do not usually permit experimentation that might 
determine either degrees of uncertainty or calculation of risk. Aside from 
these objections the "theory" of direct democracy is little supported (Edelman 
1988). For such a theory to be convincing it would, at a minimum, have to 
establish the mechanisms through which public preferences are translated into 
policy. Available work on political power and influence demonstrates that 
such direct democracy 1s a fantasy (Jasanoff 1986). 

Organizations, elites, and professions (including experts) are also the main 
institutional actors who make choices among technologies (including neglect- 
ing to develop them). For instance, Jasper (1990, 1992) shows how French 
nuclear elites set that country on the atomic path, defeating alternatives along 
the way. Similarly, Heimer (1992) shows how hospitals choose the kind and 
quality of information to transmit to parents of gravely ill children, shaping 


*Note that constructionism does not deny driving drunk can pose an objective risk. It denies 
that the way to theorize such risks 1s obvious 
s 1$ the sense ın which the term is almost always used, cf Edelman (1988) on “constructing 
the public " 
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parents’ perceptions of illness. As Hilgartner (1992:51) puts the point: those 
who “devote sustained attention to constructing facts and machines, laws and 
regulations, organizations and management systems, research programs and 
data collection systems, risk objects and networks for controlling them” have 
the power to choose. 

When constructionist arguments go beyond simply pointing out that the 
world is not naturally given, they tend to highlight the processes through 
which action happens and meanings are constructed. The corollary is that 
constructionism encourages counterfactual thinking; that is, paying attention 
to what could have happened, but did not. In explaining how technologies, 
risks, and social organizations are created, theories can account for how 
different interests shape raw materials into something recognizably real. By 
looking askance at official representations of reality we realize they are not 
the only possible social constructions; alternative constructions are always 
possible. As Abbott (1990:142) holds in another context, “most possible 
events don’t happen.” This claim implies that theories need to explain why 
alternative possible realities fell away, were defeated, or never arose in the 
first place (Royce 1985). Such conceptual and empirical exercises thwart 
teleological reasoning, and make explicit a theory’s causal assumptions. 


FAIRNESS AND TRUST 


Probabilities of technical failure may or may not have increased over the past 
30 years, but the levels and intensity of social conflict over those probabilities 
certainly have. Opportunities for communities and publics to express their 
preferences, and their fears, have expanded. There are more legal, organiza- 
tional, and political mechanisms available to use in social struggles over risk. 
This not to say that organizations (and the elites who attempt to control 
organizations) must bend to the whims, or wisdom, of their constituents and 
the general public. But certainly public acceptance of Nuclear Regulatory 
Commission claims, after a radiation release, that “there is no risk to the 
public,” is lower than when the Atomic Energy Commission (the NRC’s 
predecessor) made similar proclamations. Few would gainsay that threat 
perception is now more likely to galvanize public organization and complaint. 
This enhanced probability of challenge has driven much of the interest in risk 
perception (and possibly vice-versa). 

Much early writing on risk, especially by physical scientists and engineers, 
addressed an apparent discrepancy between objective and perceived risks (see, 
most representatively, Starr 1969; more recently, see Cohen 1985; Douglas 
& Wildavsky 1982). Since psychology produces experts in perception, 
scholars in that field were fated to make the initial contributions. Many 
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psychologists accepted the objective/perception formulation of the problem, 
at least early on. After discovering (or creating) the idea of chasmic gaps 
between perception and reality, the focus turned to the media and also, to 
some extent, to local protest groups as sources for the public's out-of-propor- 
tion risk assessments. It was perhaps inevitable that there would be talk of 
public irrationality and NIMBYs, the latter chiefly an epithet aimed at 
protesters of official definitions of reality. The political implications of 
research on risk perception have not been lost on policy, and risk, creators. 

Politics aside, the response from social science has been four fold. One, 
from social constructionism, claims that the notion of objective risk is 
fundamentally flawed to begin with, as noted above. A second response has 
been to normalize the irrationality by showing that hardly anyone makes 
decisions rationally. From this view, while the public may judge on 
nonrational grounds (Slovic 1987), so do corporate managers (Mintzberg 
1973, March & Shapiro 1987) and government regulators (avowed politicians 
were never in doubt). Even experts (Lichtenstein & Fischhoff 1977, Mazur 
1985, Freudenburg 1988) are prone to extrarational biases such as organiza- 
tional pressures and professional ideologies (Dietz & Rycroft 1987). À series 
of case studies demonstrates how experts and decision makers become 
intransigently committed to beliefs about what is and is not risky, while 
non-experts seek new information in uncertain situations (Levine 1982, 
Edelstein 1988, Brown & Mikkelsen 1990). 

A third argument holds that non-experts are in fact rational, but in 
nonobvious ways that are neglected by traditional approaches that center on 
probability theory and benefit/cost logic. Probability theory counsels concern 
about what is most probable (and thereby stakes a claim for what is important 
to worry about); benefit/cost reasoning adequately registers damages that are 
obvious (e.g. body counts, morbidity rates). Perrow (1984) proposed the idea 
of social rationality, which expands the range of phenomena that risk analysis 
should be concerned with. Specifically, research places catastrophic potential 
high in hierarchies of public worry. Compared to likely events (auto 
accidents), catastrophes (nuclear meltdowns) wreak social and cultural dam- 
age, sometimes without immediate, visible damage (Love Canal) and some- 
times with it (Chernobyl). Work on natural hazards and technological risks 
and uncertainties affirms the importance of social disruptions—to community 
structure and ethos, hopes and expectations for family futures, institutional 
legitimacy, collective responsibility—that are hard for markets to price, hence 
hard to include in a benefit/cost calculus. At bottom, disdainful portrayals of 
non-expert judgments as irrational emotion portray individuals as social atoms, 
neglecting larger social and cultural contexts. 

A fourth response reframes the issue of perception altogether. From this 
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view, it is not probabilities and risks that so concern people, but fairness, 
competence, and responsibility. Rayner & Cantor (1987:4) make the point 
that most people probably do not worry about small differences in 
probabilities when faced with risk management decisions. Rather than 
technical issues, people worry most about how risky decisions are made: 
Are they made fairly? Whose interests are served? Are costs and benefits 
distributed equitably? Why this decision and not another? Who has assumed 
authority in the situation? 

Freudenburg (1993) addresses the same issue differently, drawing on Weber 
and Durkheim to make a sociological argument about trust and risk. Using 
several sources of evidence, he finds that age, sex, political party affiliation, 
and self-assessed political ideology do not predict risk perception very well, 
contrary to expectations. Rather, the best predictors are the degree to which 
people trust science and business and governmental ability to manage danger 
(two main sources of evidence are surveys on nuclear waste sites). He 
emphasizes, with others (e.g. Douglas & Wildavsky 1982, Wildavsky 1988), 
the apparent paradox that science and technology can engender distrust, even 
as they extend and enhance modern life. Rather than argue a particular side 
of the paradox, however, Freudenburg resolves it theoretically. Increasing 
social and technical complexity “elevate the probability that some key portions 
of the system .. cannot be safely counted on." Modernity also entails increased 
lıfe expectancy. The division of labor, though a source of riches, increases 
vulnerabilities to others' failures to fulfill their responsibilities. He proposes 
the word "recreancy" as a relatively unloaded term to refer to such failures. 
The issue “is at least in part the risk that socially consequential actors will 
fail to carry out their duties in the necessary ways."* Note the argument is 
not that the division of labor increases probabilities of death and destruction. 
It is, rather, that expected length and quality of life are more vulnerable to 
others' failures. While our great-grandparents might have expected to live to 
60, we now expect to live to 85 years of age, so we seem to be better off 
than our ancestors. Still, if someone's accident shortens our lives to 70, levels 
of distrust will go up. 

Freudenburg's theory would be stronger were direct evidence available that 
people worry more about villainy and recreancy than technology, potential 
catastrophes, or toxic materials. Also, we need more work on the dynamics 


*Future work on recreancy can differentiate between fiduciary responsibility (a necessary, 1f 
not sufficient, condition for fairness), technical competence, and expectations concerning the 
moral and social orders Each of these meanings of trust (Barber 1983) may have different 
significance for people who must trust experts and organizations, or who are being asked to trust 
them Many may care only that a key part of the system can't be "counted on." From the point 
of view of experts and organizations trying to establish grounds for legitimacy, the distinctions 
are important. 
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that increase the risk of recreancy. Freudenburg notes that greater differenti- 
ation diffuses responsibility, including responsibility for ensuring systems are 
safe. Elsewhere, he and Pastor (1992) outline a theory of how regulatory 
agencies engender diminution of public trust. Another possibility might be 
the following. Since the utility of probability theory is ambiguous for most 
individual risk decisions, it is sensible for people not to become probability 
experts or formal risk analysts. Task differentiation creates specialists in such 
matters, which brings claims for legitimacy, grounded on superior knowledge 
and training. But failures, scandals, and a lay understanding of bounded 
rationality lead to distrust, sometimes cynicism (Lipset & Schneider 1983). 

Expectations about the quality of life, including notions of the right to 
control it (a strong ideology), have risen. We are, as Freudenburg claims, 
more interdependent with others and hence more vulnerable to their failures, 
whatever the reasons for such failures. Risk management organizations and 
institutions almost always assert they can control risky systems—even when 
such control is uncertain—and hence that people should trust them. When the 
inevitable accident (or catastrophe) happens, risk managers rarely volunteer 
their responsibility for the damage or recognize that factors that are difficult 
to measure (as mentioned above) might suffer. More common are public 
relations campaigns, marked by cliches ("life is a risk") and denunciations 
(“Chicken Little was scared too"). People know public relations campaigns 
have more to do with selling products than communication, and salespeople 
do not command high levels of trust. 

Theorizing about fairness, trust, and recreancy places institutional legiti- 
macy at the forefront of sociological risk analysis. Such theorizing thereby 
highlights two things often neglected by theories of the mismatch between 
public perceptions and objective risks. First, with regard to publics, individ- 
uals and their decision procedures are placed within institutional contexts, 
thus drawing our analytic focus to bases of association from which people 
denve meaning and a sense of community (Kunreuther et al 1985, Hadari 
1987). A focus on individual risk perception easily neglects organizational 
and institutional contexts within which individual perceptions are shaped. 
Studies of risk perception profit by focusing on these contexts, and on 
institutional processes of decision making (Vaughan 1990, Heimer 1992). 

Second, concentrating on institutional legitimacy and trust suggests a 
research agenda that emphasizes decision processes 1n powerful organizations, 
as well as among powerful decision makers. It is those processes, and those 
organizations, rather than an amorphous public, that allocate resources toward 
and away from risks and technologies. It is organizations, mostly, that set the 
terms of debate concerning risk acceptability. And it is organizations, and 
their putative masters, that make choices about risk that often have im- 
plications well beyond their immediate environs. Indeed, theory and research 
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on externalities and social costs, long the province of economists, should be 
incorporated into risk scholarship (Freudenburg 1986). 


HUMAN ERROR 


The most common explanation, especially from officials charged with 
managing risky systems, for why organizations and technology break down 
is human or operator error. Consider, for instance, oil tanker accidents, a 
topic that has received some scholarly and popular attention since the wreck 
of the Exxon Valdez in 1989.? Tasca (1989:Ch. 1) notes that “oil representa- 
tives and oil regulators frequently claim that 80 percent of significant marine 
accidents are caused by human error.” In November 1989 a Coast Guard Rear 
Admiral, discussing the Exxon spill at a conference in New York, declared 
“the problem here was the captain.” Similar declarations are common in all 
industries. 

“Human error” explains sociotechnical failure by holding that individuals 
are mcompetent, or poorly trained, or confused, or do not follow the rules. 
According to this logic if individuals can be fixed or replaced, four out of five 
marine accidents would not happen. For an important review and synthesis of 
work on human error, see Reason (1990); for a sociological critique, see Perrow 
(1983). Here we mention the key sociological issues regarding the idea. 

Certainly humans err, and often patterns appear in how and why those 
errors occur, but accidents usually result from factors other than individual 
characteristics. Hirschhorn (1984:79—80) points out that: 


Manuals, protocols, and job instructions may dictate a range of procedures, but 
in the real world, errors of omission and commission may be caused by job 
turnover, the stress of shift work, backbiting among workers, the pressure to 
complete a maintenance job and return the system to service, a supervisor out 
sick, disgruntled maintenance workers, an operator who 1s ill or asleep at the 
controls—in short, all the contingencies and events of human living. We often 
find the tendency toward error exacerbated by the social organization of jobs and 
skills and the information relationships that shape cooperation at work. 


Heimer (1988) adds to this list of “contingencies” the fear of being fired. 
Further, management influences how cognitive premises are set for workers. 
As Freudenburg (1992:19) puts it: "Even if no one in the organization has 
ever said the workers should take on unnecessary risks — corporate leaders 
rarely complain about employees who do too much for the firm, more often 
voicing complaints about those who do too little." 


«Human error" 1s not singular to marine accidents [t 1s also commonly found in nuclear 
accidents, airline crashes, train derailments, and 80 on 
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In the preceding remarks one can see humans erring, but because of 
social structure and culture rather than individual failures. The intellectual 
problem is that as a theory of sociotechnical breakdown, human error 
presumes more than it explains, obscuring the complexities of interaction 
between humans, machines, and organization. We learn more about how 
risks are produced by theorizing organizational factors such as production 
pressures, managerial expectations, and regulatory effectiveness than by 
focusing on human error. 

We would be remiss not to mention a couple of the consequences of blaming 
accidents on operator error. It deflects attention from systemic failures and 
managerial mistakes. For more extensive treatment of the causes and 
consequences of human-error-as-ideology see Perrow (1986a), Tasca (1989), 
and Clarke (1992). Moreover, by off-loading nearly all responsibility to 
individuals, human error helps keep systems running that might, in fact, be 
error-prone. 

If the theory of human error is so weak, why is it produced and promulgated 
in the first place? At least in the maritime industry, Tasca shows how the 
standard operating procedures for accident investigations in the National 
Transportation Safety Board and the US Coast Guard give rise to the notion 
that “human error” is the main problem in marine accidents.° Further, in 
accident investigations and regulatory issues, the Coast Guard is more a 
confederate than a regulator of shippers. “Representatives of shipping firms 
and oil companies," Tasca demonstrates, "comprise the majority of ‘outside’ 
members of the Coast Guard's formal advisory committees." The analytic 
procedures of the National Tansportation Safety Board (the United States’ 
major investigator of transportation accidents) prevent investigators from going 
much beyond attributions of human error in final reports. NTSB investigations 
focus mainly on the types of human error responsible for accidents, rather than 
possible underlying causes of the errors. "Readers [of these reports]," writes 
Tasca, "are rarely invited to focus their attention on . . . shipboard equipment, 
maintenance practices, vessel design, or shoreside management." Tasca points 
out that the Coast Guard bears legal responsibility for "safe and healthful 
working conditions" on US vessels, so working conditions onboard are not 
often mentioned in Coast Guard accident investigations JH may be no accident 
that the human error theory of system breakdown places responsibility on 
system operators and little or none on the system itself. 


“The facts and quotes in this paragraph come from Tasca (1989. Ch 5). 

"They were mentioned in the NTSB's report on the Exxon Valdez, "Marine Accident Report, 
Grounding of the US Tankship Exxon Valdez on Bligh Reef," Prince William Sound Near Valdez, 
Alaska, March 24, 1989, National Transportanon Safety Board, PB90-916405 
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NORMAL ACCIDENTS 


Perrow’s (1984) explanation for how and why accidents happen goes well 
beyond human error by theorizing failures as a function of how organizations 
are configured (see also Turner 1976, Gephart 1984). Perrow classifies 
systems in terms of how system components interact (varying from complex 
to linear) and degree of coupling (varying from tight to loose). 

Linear interactions are those interactions of one component in the DEPOSE 
system (Design, Equipment, Procedures, Operators, Supplies and materials, 
and Environment) with one or more components that precede or follow it 
immediately in the sequence of production. Complex interactions are those 
in which one component can interact with one or more other components 
outside of the normal production sequence, either by design or not by design 
(Perrow 1984:77—78). 

Tightly coupled systems "have more time-dependent processes" and their 
sequences "are more invariant," they have invariant production sequences and 
“little slack” (Perrow 1984:93—94). Linear interactions are expected, familiar, 
"quite visible even if unplanned"; complex interactions are unexpected, 
unplanned, and "either not visible or not immediately comprehensible.” 
Perrow cross-classifies interaction and coupling and finds that, while all 
systems can fail, catastrophes are more likely when systems are tightly coupled 
and interactively complex. 

Perrow (in press) extends the argument by considering why, if accidents 
are inevitable, catastrophes are rare. "Since catastrophes based on system 
accidents require joint probabilities [of failure], both of which are rare, they 
are truly rare," he says. This helps clanfy the theory, although without 
specifying the conditions under which joint probabilities will occur, we still 
lack a theory of catastrophe (as Perrow grants). But he does suggest that the 
following factors, here listed with examples, “affect the frequency of small 
errors and thus the potential for serious accidents and catastrophes": experi- 
ence with scale (size of chemical plant operations), experience with critical 
phases (airliner takeoffs and landings), information on errors (US nuclear 
industry controls most safety data), propinquity of elites to systems (Congress 
members must fly), organizational control over members (aircraft carriers are 
total institutions), organizational density of the environment (airline industry 
is rich with actors concerned with avoiding blame for accidents). 

The power of Normal Accidents is in locating the source of risk in 
organization per se. It is a structural perspective that denies much human 
ability to manipulate social systems, with the exception of choosing to 
implement systems with catastrophic potential in the first place. Once such a 
system is chosen, human efforts are limited mainly to fine tuning and damage 
control. 
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HIGH RELIABILITY ORGANIZATIONS 


The challenge to normal accident theory emanates from research on apparently 
high reliability organizations (HROs). 8 We lack a precise definition of high 
reliability organization, but the organizations in the studies are complex, 
tightly coupled, and high hazard, yet accident free. This, of course, directly 
contradicts what normal accident theory would predict. A close reading of 
the HRO work points the way toward resolving the paradox but raises 
questions about the criteria by which organizations may be judged highly 
reliable (high reliability = safety). 

La Porte & Consolini (1991) note that HROs have “very clear, well-agreed- 
upon operational goals” and “consensus is unequivocal” (La Porte & Consolini 
1991:35, 24). They enjoy abundant resources so short-term efficiencies can 
be neglected for the sake of reliable operation. They are also characterized 
by human redundancy and a highly functional, highly flexible division of 
labor. While in routine operations decisions are bounded by well-formulated 
and tested SOPs, HROs “avoid crippling operational failures" by shifting “to 
a basis of functional skill" (La Porte & Consolini 1991:32, Rochlin 1989). 
In other words when the going gets tough, HROs get flat (lines of 
responsibility blur) so that skill and knowledge rather than bureaucratic 
authority drive decision making. 

HROs use “internal cross-checks on decisions, even at the micro level, and 
fail-safe redundancy .” to determine when “critical decisions are timely and 
correct” (Rochlin 1988). Training and monitoring encourage a culture of 
responsibility and accountability (see also Tamuz 1988). HROs achieve a high 
level of social control by limiting influences from environments external to 
the organization. HROs “come close to meeting the conditions of closed 
rational systems, i.e. a well-buffered, well-understood technical core requiring 
consistency and stability..” (La Porte & Consolini 1991:24). 

The main research sites have been two aircraft carriers, air traffic control, 
and the Diablo Canyon power station in California.” Though HROs are a 
“small group of organizations” (Rochlin et al 1987), the work is important. 
HRO researchers claim to document decentralization amid bureaucratic 


5Progenitors were Todd La Porte, Karlene Roberts, and Gene Rochlin. Morone & Woodhouse 
(1986), Weick (1987), Tamuz (1988), and Wildavsky (1988) are also concerned with high 
reliability Sagan (1993°Ch 1) thoroughly considers all available work on the topic (1993:Ch 
1) 

"Scholarly work on the Diablo Canyon power station is now appearing (La Porte & Thomas 
1991) but is not sufficiently developed to discuss in detail The La Porte & Thomas paper attributes 
the station’s "effective, very reliable and safe operations” to a commitment to quality and strong 
potential regulatory sanctions The table of contents of a book in press (Roberts 1993) suggests 
HRO research will provide more data on electne power grids, nuclear plants, microcomputer 
firms, and atr traffic control. The chapters were not available for review 


390 CLARKE & SHORT 


centralization, autonomy amid dependency, and flexible decision-making 
amid autocracy. HROs are sometimes highly rigid, sometimes well nigh 
democratic hierarchies, thus contravening extant theory, from contingency to 
power models. Further, La Porte & Consolini note (1991:28) that organi- 
zation theory has not seriously considered "how the challenge to be highly 
reliable alters decision-making strategies." Put more generally, HRO research 
raises the important question of how decision processes vary systematically 
according to the extent and intensity of environmental demands for safety. 

While HRO work raises important questions, the criteria by which 
organizations are classified as HROs are vague.!! It is not clear how to 
distinguish between high reliability organizations and trial and error organi- 
zations, as HRO theorists call them, independently of disaster frequency. That 
15, HRO studies interpret the absence of low frequency events (such as 
meltdowns) as evidence of high reliability. HRO theorists grapple with the 
problem in different ways. Roberts (1990:160) proposes calculating a ratio of 
the number of actual failures to the number of possible failures. Such a 
measure would be ideal, though gathering data for the denominator would 
probably be impossible for most industries and organizations. Rochlin (1993) 
proposes that “an organization . can be said to ‘successfully’ deliver “high 
performance’ and ‘high reliability’... if the public being served judges it to 
have provided both an adequate level of service and an acceptable level of 
safety and reliability.” This definition, however, would reorient HRO 
research. Rather than data on failures, we would need data on risk perception 
and recreancy; rather than high reliability organizations, the research object 
would be high legitimacy organizations. 

Aside from conceptual problems, the Naval vessels that to date provide 
most of the project’s output are systems in semi-simulation, not operational 
mode. A fully operating aircraft carrier 1s a system 1n combat, or facing the 
threat of combat. Two incidents—the grounding of the USS Enterprise and 
the USS Vincennes’ shootdown of Iran Air flight #655 during the Iran-Iraq 
war—1eported by core members of the HRO group suggest that such 
conditions can be important. 

The Enterprise suffered hull damage while sailing through clearly marked, 
well-known shallow waters. Roberts' painstaking analysis shows multiple 
definitions of reality were spawned but not checked; some officers knew the 


Contingency theorists in organizational sociology have long maintained that centralization 
and decentralization can exist in the same organization in different units, HRO research claims 
to find the same highly centralized units decentralizing when tasks demand it. 

N Another problem with the work presented to date is that the data do not provide the most 
conservative test of the claim that the organizations are "nearly error free." Clarke is writing an 
article that considers these problems in more detail 


ORGANIZATION AND RISK 391] 


vessel was on the wrong course but failed to take responsibility for informing 
the commanding officer. "The norm of ‘don’t tell the old man” was strong 
here” (it is always strong in military situations), and it prevented reliable 
cross-checking and distributed responsibility. 

The facts are clear in the Vincennes case (Rochlin 1991), where 290 
civilians died. The flight was scheduled, and Vincennes’ officers had the 
schedule but did not see it on the list. “Various personnel” reported the plane 
was squawking a code that identified it as an Iranian F-14, but recorded 
data—from the Vincennes—show it never did. (Iranian fighter codes were 
transmitted from other sources.) Officers testified that 655 was descending, 
indicating a possible attack, but in fact it was ascending. The flight was 
tracked “almost from the moment of its takeoff,” but the "[Combat Information 
Center] aboard the Vincennes |was] one of confusion and disorder." A 
companion vessel, the Sides, correctly identified the aircraft and turned away. 
The most likely explanation for the disaster is that an attack scenario was put 
forth, and through some process of group-think (the Navy called it "scenario 
fulfillment"), became defined as real. Later, Navy investigators focused only 
on the Commanding Officer and found the shooting justified (he was given 
a medal). As Rochlin points out, the larger, neglected, issue was “how the 
situation had developed so badly and wAy the information being provided was 
so skewed from reality." 

Neither of these cases document highly reliable organizations, as Roberts 
and Rochlin know. There was massive organizational failure on the Vincennes, 
and the subsequent Navy investigation did not show organizational learning. 
Rochlin points out that cruisers like the Vincennes are automated in ways that 
thwart the human networks that produce reliability. This case suggests combat 
is indeed an important variable in influencing the reliability of combat oriented 
systems. 

HRO research underscores the crucial task of specifying the conditions 
under which a theory will hold. Perhaps HRO research can perform this task 
for normal accident theory. The principle specification tells how organiza- 
tional units can be simultaneously centralized and decentralized (cf Perrow 
1984:Ch. 9, 1986b:146—154). Based on HRO research, potentially flat 
hierarchies may appear to enhance reliability when: 


|. Organizations have almost unlimited control over operators who are drilled 
incessantly. As Roberts (1990:173) notes. HROs “are intermittently the 
most total of total institutions.'* Operators are more likely to assume 
responsibility when they face severe sanctions for not doing so. 


Sagan (1993), in an important book, lays heavy emphasis on this "closed system aspect of 
HROs 
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2. Operators’ cognitive premises are set and enforced with a minimum of 
outside influence. Weick (1987:124) points out that "[S]uccessful decen- 
tralization . [is] always preceded by a period of intense centralization 
where a set of core values were hammered out and socialized into people 
before the people were turned loose to go their ‘independent,’ 'auton- 
omous' ways." Only total institutions can so completely set the definition 
of "independent" and "autonomous"; competing definitions (e.g. those of 
regulators) might give rise to goal conflict and less operator accountability. 

3. Information 1s sharply focused on agreed-upon tasks that require decisions 
regarding agreed-upon actions Were any of these features absent, as 1s 
the case with many risk situations, HROs would probably be less reliable. 

4. Organizations are highly differentiated, and occupations are highly spe- 
cialized. Roberts (1990:165) again: "The deck itself 1s divided into three 
distinct areas and men are assigned to work in one area at the specialized 
jobs they have been taught." Programmed flexibility must finely delimit 
the parameters of what operators can decide. 

5. Outside scrutiny of internal operations 1s rigidly limited. Organizations 
are, among other things, arenas in which interests and passions are given 
play. This suggests that, to the extent HROs are error free, competing 
definitions of effectiveness have been eliminated. 


This restatement, however, emphasizes power and authority far more than 
is presently the case in the expanding HRO school. 


ORGANIZATIONAL RESPONSE TO ACCIDENTS 


Organizations not only create accidents, they also respond to them, thereby 
mitigating or exacerbating the consequences. This is another way organiza- 
tons can amplify risks Depending on organizations for rescue creates 
something of a paradox: organizations are built on predictability, but accidents 
by definition involve unpredictability. How then to theonze organizational 
response to accidents? 

Work on how organizations respond to accidents revolves around problems 
of planning. (An alternative metaphor, used in a small but highly visible 
research arena, 1s “organizational learning,” on which see Levitt & March 
1988 and March et al 1991.) Though organizations bear the primary 
responsibility for responding to disasters, their structures and standard 
operating procedures may thwart response efforts in important ways. Organ- 
izational procedures for responding to problems are usually based on past 
experience; organizations tend to institutionalize plans for responding to 
uncertainty when problems recur (Meyer et al 1985). This suggests that 
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organizational preparation for potential technological catastrophes 1s inher- 
ently vexing, primarily because such events are too infrequent to permit 
appreciable accumulation of knowledge about them (Stinchcombe 1990). 
From this view, an insufficient number of failures prevents organizations from 
accurately perceiving the risk of such events. In an article on Challenger, 
Starbuck & Milliken (1988:319) argue that "organizations often interpret past 
successes as evidencing their competence and the adequacy of their proce- 
dures, and so they try to lock their behaviors into existing patterns.” 

A major reason for formal organization is to simplify decisions, and the 
key simplifying mechanism is to institute routines (March & Simon 1958, 
simon 1976). These routines then become the templates through which 
organizations filter information and hence organize action. Organizations are 
organized to be inflexible. 

To take an example concerning the Exxon oil spill, Exxon and Alyeska 
were well prepared for Arctic exploration, oil management, and political 
influence. They were not as well prepared for crisis management. Or, 
considering all the organizations involved in Alaska, the risks of tanker failure 
were insufficiently covered by any organization or set of organizations. If 
organizations were infinitely flexible, they would generallv be ineffective in 
day-to-day operations. Further, even infinite flexibility would not guarantee 
effective response to off standard demands like large oil spills. Organization 
can itself become a barrier to effective response. 

Two examples make the point. There are many sensitive, and important, 
salmon hatcheries in Prince William Sound. Many of these hatcheries were 
directly in the path of the oil slick. One of the most important salmon 
hatchenes 1s called Sawmill Bay. This is one of the bays that Alyeska's 
contingency plan claimed would be boomed within six hours of a 4000 gallon 
spill. Unfortunately, for the first several days after the accident, no major 
organization moved to boom Sawmill Bay, or any other hatchery for that 
matter. The fishers of Cordova, along with other fishing villages, asked first 
Exxon, then the Alaska Department of Environmental Conservation to boom 
Sawmill Bay, to no effect In frustration, the fishers banded together, pooled 
their monies, and purchased boom from as far away as Japan and Europe, 
saving the Bay from oil contamination. Officials responsible for mitigating 
the effects of oil spills were apparently 1mmobilized by their own organiza- 
tions. The fishers, unencumbered by entrenched organization with vested 
interests in multiple goals, were able to exploit their resources to win what 
was referred to in Alaska as the Battle of Sawmull Bay. 

Shrivastava (1987:63) provides the other example. After Bhopal, while 
"the immense government bureaucracy" was just beginning to move, "the 
major portion of relief was provided by volunteers from Bhopal and other 
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parts of India—individuals, charitable institutions, social workers, and social 
clubs." The point is that, in the main, organizations do routine things well, 
and since large failures are (fortunately) uncommon, we should be surprised 
when organizations do respond well to problems like the grounding of the 
Exxon Valdez. We need more work on the possibilities of network and 
decentralized responses to disasters, even in the presence of centralized 
bureaucracies (see, for comparison, Sable & Zeitlin [1985] and Sable & 
Piore [1984]. 


CONCLUSION | 

We have slighted several risk-related topics that seem promising for further 
development, but which at present lack a sufficient body of research to warrant 
detailed review." 

Issues of social distribution of risk (by social class, sex, race, ethnicity, 
and location in the world economy) are especially important to considerations 
of fairness and trust. The popular press, some reports from regulatory 
agencies, and some scholarship have recently focused on such issues (see 
references under Fairness and Trust, above; also Bullard 1990, Bryant & 
Mohai 1992). These analyses suggest that poor people—among whom 
African-Americans are disproportionately represented in the United States— 
disproportionately bear the consequences of risk-decisions concerning toxic 
waste, chemical plants, garbage incinerators, and so on. The extent to which 
this is so is exceedingly important and is an issue with which social scientists 
and others are likely to become increasingly concerned. Distributional issues 
are important not only for policy and political reasons, but for how we theorize 
power, decision making, and institutional analyses of risk in general. 

Policy elites (e.g. Starr 1969) and some prominent social scientists (e.g. 
Wildavsky 1988) argue against focusing on technical hazards when smoking, 
diet, and other apparently controllable, individual behaviors kill, cause or 
aggravate disease, and otherwise disable thousands of people every year. The 
larger issue, ripe for research and theory, concerns the sociology of lifestyles 
and consumption. To the extent that lifestyles, and hence lifestyle risks, are 
culturally embedded and socially organized, institutions as well as individual 
choices need intellectual attention. 

Cross-cultural and cross-national work on risk is growing rapidly (see 
Krimsky & Golding, forthcoming). Culture influences decision processes, as 


P We also neglected studies of specific types of risks and uncertainty, such as those associated 
with nuclear power, radioactive waste disposal, global warming, air and water pollution, and 
environmental degradation (see, e g , Catton 1980, Freudenburg & Rosa 1984, Buttel 1987, 
Buttel & Taylor 1992, Dunlap et al, in press) 


ORGANIZATION AND RISK 395 


ethnographic, legal, and organizational scholarship demonstrate (see Douglas 
198, Jasanoff 1986). Similar organizations don't always respond similarly to 
similar problems, and decision processes vary by national and cultural context 
(see Hutter & Lloyd-Bostock 1992, Kleinhesselink & Rosa 1991, Jasper 
1990). In addition to being substantively interesting, comparative research 
helps bound theories and yields better understandings of the influences of 
legal processes, power, legitimation, rationalities, technical information, 
regulatory strategies, hands-on experience, and organizational interdepen- 
dencies. 

Sociological studies of social movements and social movement organiza- 
tions, network analysis, and interest group formation and activity are relevant 
to risk management and often to risk assessment, but have too rarely been 
applied to risk (see Douglas & Wildavsky 1982, Wildavsky & Dake 1990, 
Renn 1992). The importance of organizational structures, networks, and broad 
institutional factors such as political economies and cultural expectations 
emphasizes the need for further comparative work on these topics. 

Theories of organizational risk perception are now emerging. They con- 
centrate on how and why organizations collect, transform, and use informa- 
tion. Key concepts are concerned with how organizational interests influence 
information classification and interpretation, how information is used by 
responsible organizational elites and by contending elites within organizations, 
the degree to which technical elites and managerial elites are autonomous or 
interdependent, and the symbolic roles of technical information (Sabatier 
1978, Feldman & March 1981, Starbuck & Milliken 1988, Heimer 1985, 
1992, Weick 1987, Stinchcombe 1990, Vaughan 1992, Clarke 1993). 

The legal basis of risk management is of universal concern to the 
nation-state. Legal traditions and cultures influence the legal status of risk 
assessment as well (Jasanoff 1986). Despite its relevance and ubiquity, legal 
scholarship in the area remains relatively undeveloped (but see Short 1990, 
Felsteiner & Seigelman 1992, Hawkins 1992, Sanders 1992, Shapiro 1992). 

Communication between risk producers and potential risk bearers is 
receiving substantial political and some scholarly attention. Much of the talk 
about risk communication follows a public relations model, premised on 
convincing potential victims that their fears are unjustified. But some work 
shows meaningful, two-way risk communication is possible, with positive 
effects on community relations and risk reduction (National Research Council 
1989, Stem 1991, Chess et al, 1992, Kunreuther et al no date, Clarke & 
Freudenburg 1993). 

We conclude with the conjecture that intellectual work on risk has at least 
two, probably not mutually exclusive, futures. It could become another social 
problems area. In this future, risk studies would be like work on homelessness, 
crime, or mental illness. important in its own right, but oriented as much 
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toward alleviating injustice and effecting social change as toward conceptual 
development and social theory. Sociology of risk would then remain a distinct 
area of study, because its subject matter—catastrophe and calamity—would 
require it. Grants, publications, and policy relevant advice do not redound to 
areas of inquiry that deny their singularity. 

A second future would lead to the mainstream. To the extent that this future 
is realized, the “sociology of risk" might cease to exist. It would, rather, be 
absorbed into the catalogue of established concerns—concerns such as social 
conflict, social structure, power, social movements, and culture. A truly 
successful sociology of risk would, in both a practical and a theoretical sense, 
put itself out of business. 
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Abstract 


Though critically important for children’s long-term well being, the beginning 
school transition generally has been neglected by sociologists interested in 
issues of schooling and social inequality. This chapter examines early school 
from an educational stratification perspective. Life-stage and developmental 
considerations heighten young children’s sensitivity to school influences 
generally, while pressures associated with social role transitions (i.e. from 
“home child" to “school child") challenge them. We review how out-of-school 
social structural influences associated with poverty, ethnicity, and family type 
complicate early school adjustment. Reviewed too are various structural 
arrangements in the social organization of early schooling (e.g. access to 
preschools, the restricted socioeconomic variability of elementary schools, 
and various kinds of educational tracking") that can either reinforce or offset 
out-of-school influences. We conclude with a call for more work on 
mechanisms of educational stratification in the early grades. 


Stuck in the Margin of the Dream 


Letitia 1s “8 years old, female, and the color of cinnamon. A few weeks ago she 
completed the second grade in a public school in one of Baltimore's most 
impoverisbed neighborhoods For her good work she received a trophy inscribed 
with her name. But for a battery of tests, results were dismal. In four years 
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Letitia won't need overt discrimination to stay glued to the margins of the 
American dream." 

M. P. Fernandez-Kelly 

Baltimore Evening Sun, Baltimore 

July 12, 1991 


Although largely disguised or unnoticed, educational stratification begins in 
earnest when children start formal schooling, or just before, at roughly ages 
3 to 7. Yet for the most part the socially structured sorting mechanisms that 
help or hinder children of this age are unresearched. The purpose of this 
chapter is to point out the many ways, some hidden, in which stratification 
occurs during early schooling. 

In 1989, almost 44% of African-American parents and 36% of Hispanic 
parents, compared to 15% of white parents, fell below the poverty line (The 
State of America’s Children 1991). Children who are poor very often get low 
grades and test scores in school. They often drop out before finishing high 
school, and they repeat grades all along the line. Although proportionately 
more African-American and Hispanic children are poor, poverty is not their 
only burden. At the elementary level they are also likely to attend schools . 
with others who, like themselves, are poor and minority-group members. This 
is one of the ways "concentrated poverty" in inner city neighborhoods touches 
not just those mired in poverty, but also children struggling to survive at its 
edges (see Neckerman & Wilson 1988). 

Although national data are lacking, data for Baltimore City illustrate how 
closely school demographics reflect residential demographics, especially at 
the elementary level. In 1970 Baltimore was still a majority white city (5396), 
but between 1970 and 1990 the white population decreased by 4096 while the 
African-American population increased by about 4% (US Bureau of the 
Census 1973, 1991). As the white public school population decreased, racial 
segregation ın the public schools intensified. By 1990, in three quarters of 
the City's elementary schools, over 9096 of the students were of a single race, 
with 58% of the white students clustered in 14% of elementary schools which 
had white enrollments of 75% or more (Baltimore City Public Schools 1992). 

Baltimore's public school population is also more disadvantaged than the 
city population as a whole. Data for 1988 show that 6096 of all public school 
students were receiving free lunches (indicating low family income), a figure 
that may underrepresent the true number since not all who qualified would 
apply for the program (Citizen's Planning and Housing Association 1990). 
By comparison, 1990 Census data for Baltimore show that 31.5% of persons 
between the ages of 6 and 17 live in poverty, and 17.8% of all families are 
below the poverty line, including about 46% of the households headed by 
females with children under age 18 (US Bureau of the Census 1991). In 
elementary schools the proportion receiving free lunches ranged from a low 
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of 8% to a high of 93%, but in only a handful of schools was the proportion 
below 25% (Citizen’s Planning and Housing Association 1990). 

One consequence of the socioeconomic overlap of neighborhoods and 
elementary schools frequently goes unnoticed: in the early grades school 
segregation by socioeconomic status is pronounced. There is more social class 
variation between schools and less social class variation within schools than 
there is in the higher grades. In a random sample of Baltimore children, for 
example, Rosenberg (1979) found that 47% of 8- to 11-year-olds attended 
schools where the variation in social class within the school was less than 
half of what it was for children 15 and over in the same sample. A roughly 
similar pattern also is documented in the Coleman report (Coleman et al 1966: 
Table 3.21.2). Although the concomitants of the SES segregation of US 
elementary schools remain to be spelled out, the Plowden Report (Central 
Advisory Council for Education 1967, pp. 618—19) notes that in Britain with 
respect to teachers’ salaries, nontechnical staff and wages, proportion of 
oversize classes, and expenditures for fuel and the like, all coefficients of 
variation in primary schools exceeded the coefficients for equivalent indices 
in secondary schools. School-to-school differences in resources thus could be 
of even greater consequence in the primary grades than in the upper grade 
levels. 


“RISK FACTORS” AND THE BEGINNING SCHOOL 
TRANSITION 


Social inequality among children can be measured on several dimensions: low 
economic standing, minority group membership, residence in a single parent 
family, and so on. These facets of inequality are commonly labelled “risk 
factors” for schooling (Pallas et al 1989). A child with any of these 
characteristics risks getting low test scores, failing a grade, dropping out of 
school before finishing, or having behavior problems in school. Most children 
attending public schools now are affected by at least one such risk factor, and 
projections anticipate both absolute and proportionate increases on each well 
into the next century (Natriello et al 1990). 

Disparities in test scores and the like increase as children progress through 
school, but in some ways the strongest links between risk factors and schooling 
are at the very beginning. This is seen in patterns of preschool attendance. 
Below age 5 low income and minority children are the least likely to receive 
schooling of any kind (Select Committee on Children, Youth, and Families 
1990). In 1986, for example, only 27% of 3- and 4-year-olds from low-income 
families were enrolled in preschool compared with 42% of those from more 
affluent families. Likewise, only 29% of Hispanic children in this age group 
were enrolled in preschool compared to 38% of African-Americans and 39% 
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of whites. Head Start, the only federally funded program for poor children, 
currently serves fewer than one in five of those age 3 to 5 who live in poor 
families, a ratio that has not changed significantly since 1985. 

Early studies of educational stratification (Hauser 1970) focused on 
adolescents, the school-to-work transition, or whether students continued their 
education past high school (see reviews in Kerckhoff 1980). At issue was 
how youngsters from different family circumstances—high vs low SES, 
racial/ethnic minorities vs whites—acquired educational credentials with 
different labor market value. For this purpose it made sense to concentrate 
on the levels of schooling closest to the school-to-work transition. But 
evidence is accumulating that the search for how social structure affects 
educational stratification must begin much earlier in life. The importance of 
family background for elementary children’s school performance is firmly 
established (Marjoribanks 1979, Alwin & Thornton 1984), readiness scores 
at first grade are strongly correlated with home background (Smith 1972), 
and some recent Baltimore data (see below) show that children from 
disadvantaged families get off to an especially shaky start when making the 
first grade transition. 


The Critical Period 


Recently we (Entwisle & Alexander 1989) have argued that the transition into 
full-time schooling—entry into first grade—constitutes a “critical period" for 
children’s academic development. First, the early schooling period coincides 
with important cognitive changes as children shift from preoperational to 
operational modes of thought. Second, the elementary years, and especially 
first grade, constitute a special time for acquiring the basic skills of literacy 
and numeracy, and failing to acquire skills during this time leads to an almost 
insurmountable handicap. Topics in the early curriculum like reading and 
math are highly cumulative. Sight recognition of function words like “of” and 
“in,” a first grade priority, is important for building reading skills. Math is 
even more vertically organized—all arithmetic operations relate back to the 
simple addition and set theory taught in first grade. 

Achievement trajectories also are launched during this period, for marks, 
for standardized test performance, and also for children's experiences with 
"educational sorting." Marks may be especially important for what happens 
later because there is a social-evaluative component to them. Marks affect 
children's self-images and their test score gains (Entwisle & Alexander 1988), 
and marks respond to the child's ethnic and economic characteristics as well 
as to the child's personality or ability to "get along" (Alexander et al 19938). 
Furthermore, records of marks and other school performance data follow the 
child in an accumulating dossier from first grade upward. 

Most important for us, the beginning school period entails a transition into 
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a new social world. Major role transitions are key times for development 
because they reconfigure the individual’s social roles (Elder 1968). Children 
making the beginning school transition add “school child” to their role set 
and must learn to function in an institutional context governed by conventions 
different from those at home (see Dreeben 1968). 

This transition experience may be key for the prospects of “at risk” children. 
For children brought up outside the middle class mainstream, differences 
between home and school are dramatic. The conventions of the school, with 
its achievement orientation, its expectation that children will stay on task and 
work independently without close monitoring, its tight schedule of moving 
from lesson to lesson, its use of “network” English, its insistence on 
punctuality, and its evaluation of children in terms of what they can do instead 
of who they are, all can be daunting. One reason the period is “critical,” then, 
is that conditions outside the individual are important as well as those within 
the individual. This perspective is not commonly invoked because develop- 
mental research has focused more on changes within the individual than on 
the social contexts that shape development (Bronfenbrenner & Crouter 1983). 

Many minority and disadvantaged children cross the first-grade threshold 
lacking competencies and habits of conduct that are required by the school. 
The problem is one of mismatch (Hess & Holloway 1984) between student 
and social context, referred to more recently in terms of “capital”—human, 
social, cultural (Coleman 1988). Such challenges magnify at the point of 
school entry because this is when children are acclimating to the academic 
routine, and since the beginning-school period itself is so important to 
everything that follows, their consequences will echo for many years. Early 
school adjustment difficulties revolving around race/ethnicity and socioeco- 
nomic level are illustrated in the experience of children in our Beginning 
School Study (BSS). Baltimore has no significant Hispanic population and 
for that reason the BSS compares only African-Americans and whites. We 
think it likely, however, that many of the findings for poor African-American 
children would also characterize poor Hispanic youngsters in the early grades. 


School Performance 


Our research involves a panel of children (N = 790) who began first grade 
in the fall of 1982 in 20 Baltimore City public schools and who have been 
followed prospectively since then. The sample is stratified random, with 
predominantly white and racially integrated schools and those in middle class 
neighborhoods oversampled to enable comparisons by race and SES level. At 
the very first marking period in first grade, African-American children got 
lower marks than whites in both reading and math, and marks were 
significantly lower for poor youngsters than for those from higher income 
families. The race difference increased over the year from about .2 of a 
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marking unit on a 4.0 scale in the fall (one third of a standard deviation) to 
.3 marking units in the spring (Entwisle & Alexander 1988). These differences 
involving children’s first grade report card marks withstand multivariate 
controls. 

We also examined children’s scores on standardized tests taken soon after 
first grade began. In agreement with other reports (Ginsburg & Russell 1981, 
Tizard et al 1988), race differences at the point of school entry in BSS 
children's standardized test scores in reading and math are small or negligible. 
Differences across SES levels are much larger, though—from a third to almost 
a full standard deviation depending on the specific comparison (Entwisle & 
Alexander 1990). 

Social patterning in children's early academic difficulties is not surprising, 
but it is rare to have such comparisons for a randomly selected group of 
children right at the start of school, before they are exposed to different quality 
schools. Preschool and kindergarten attendance no doubt contribute somewhat 
to individual and group differences in first grade performance, but first grade 
contrasts of the magnitude reviewed are so large they must mainly reflect 
differences in out-of-school learning opportunities. 


Seasonal Patterns 


To follow up on the idea of extra-school learning, we compared BSS children's 
gains in math performance over the summer after first grade when they were 
out of school with their gains during the previous winter when school was in 
session. In the summer when school was closed, differences in their math 
achievement associated with family economic level were much more promi- 
nent. It turned out too that differences linked to residence in a one-parent as 
opposed to a two-parent family were negligible once economic status was 
taken into account (Entwisle & Alexander 1992). When school was in session 
in the winter, first graders from poverty level homes or those from homes 
where parents were dropouts did as well or better in terms of math gains than 
did their better-off classmates. Only in summer, when school was closed, did 
patterns reverse. Then children from more advantaged circumstances made 
great strides while those from homes where resources were lacking either did 
not gain or even regressed. Schools seem to be doing a better job than they 
are credited with. These findings of large seasonal differences in test score 
gains for Baltimore first-graders replicate findings for older children by 
Murnane (1975) in New Haven and by Heyns (1978) in Atlanta. 

Strong seasonal patterns in gains on standardized tests call into question 
the,standard assumptions as to how SES affects youngsters' school perfor- 
mance. Parents' beliefs about their own role in educating their children, with 
middle class parents seeing themselves as active partners, and working class 
parents consigning the role of educator entirely to the school (Lareau 1987), 
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may explain the strong seasonal effects. Middle class parents’ physical 
involvement in school activities, on the other hand, may have little effect on 
children’s progress because in winter the children of poorer parents progress 
as much as or more than children of better-off parents. 


School Adjustment 


Standardized achievement is only one part of a larger picture, however. 
“School adjustment” comparisons reveal that poor children’s early difficulties 
are not limited to academics. As with test scores, SES, not race/ethnicity, is 
the main divide (Alexander & Entwisle 1991). Lower SES children are much 
more often identified by their kindergarten teachers as being at risk for serious 
academic or adjustment problems; they are absent more in first grade; and 
they receive lower teacher ratings on behaviors related to school adjustment 
such as interest/participation and attention span/restlessness (the latter two 
strongly predict later academic progress; see Alexander et al 1993a). 


Other Family Factors 


Family circumstances beyond poverty and race/ethnicity very likely also affect 
early schooling because children at that point are so dependent on their 
families, not only for learning resources but even for getting to school on 
time. Most knowledge about how family configuration affects children’s 
schooling is based on studies of marital disruption (Hetherington et al 1992). 
A great many one-parent households, however, are now headed by never- 
married parents. In 1988, for example, 54% of African-American children 
and 33% of Hispanic children in single-parent households were living with a 
never-married parent, which for both groups is more children than were living 
with divorced parents (US Bureau of the Census 1989). The consequences 
for children’s schooling of growing up in the family of an unmarried parent 
have yet to be spelled out. Nevertheless, children raised in two-parent or 
one-parent families do about the same on standardized tests (Hetherington et 
al 1983, Garfinkel & McLanahan 1986), and probably this finding will extend 
to children of never-married parents as well, but to our knowledge remains 
to be verified. 

The overall picture may not be as rosy as analyses of just test scores imply, 
however. Single parents are not as effective as two parents in helping their 
youngsters comply with the student role. Children from one-parent families 
get relatively poor marks, especially in deportment (Hetherington et al 1983), 
and negative effects are stronger for those who spend their entire childhoods 
in one-parent families (Krein & Beller 1988). Teachers rate children from 
one-parent homes more negatively on homework completion (Epstein 1984) 
and as less effective in study skills (Hetherington et al 1983). Over first grade, 
improvement in conduct marks, an indication of adjustment to school, is 
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greater for children from two-parent than from one-parent families (Thompson 
et al 1992). 

Research on how family type affects children's schooling neglects several 
other key issues. One is the living arrangements of single parents. Single 
mothers often live in extended families, probably because it helps in managing 
scarce economic resources (Staples & Mirande 1980). Nevertheless, with few 
exceptions (e.g. Kellam et al 1977, Furstenberg et al 1987) single parent 
families are treated in the literature as one family type, whether the family 
lives alone or in extended arrangements. This could be misleading because 
single parents who live with other adults may have more resources to support 
children's schooling than do single parents who live alone (McLanahan 1985). 

Some differences between school outcomes for white and African-American 
children may reflect family arrangements. Among single mothers about the 
same percentage of African-Americans (41%) as whites (37%) live in 
households that contain adults beside the mother (US Bureau of the Census 
1988), but since now about 5196 of African-American families are headed by 
single mothers compared to about 1696 of white families, the likelihood is 
much greater that an African-American child will live in an extended family 
than a white child will. 

Children of mothers who live in extended arrangements are better adjusted 
to school than children of mothers who live alone (Furstenberg 1976, Kellam 
et al 1977). Also, mother's age needs to be considered in studies of how 
family type affects children's school performance. Single mothers tend to be 
younger than married mothers, and children of adolescent mothers do not do 
as well in school as children of older mothers (Hofferth 1987, Ketterlinus et 
al 1991). 

Finally, how family size affects early schooling needs study in view of 
research showing the effects of family size on reading ability (Stevenson & 
Fredman 1990) and lower high school completion rates for children coming 
from large families (Blake 1989). Some of the negative effects attributed to 
single parenting could instead reflect sibship patterns since only children are 
found more often in one-parent than two-parent families (Blake 1981, 1989). 

Why children of single mothers do not do as well in school as children 
from two-parent families is not easily explained, but poverty is probably a 
major cause (Garfinkel & McLanahan 1986). The poverty rate for children 
in female-headed households—55% in 1987— is more than double that of all 
children (Table A17, National Black Child Development Institute 1990), and 
such poverty often is long term (Duncan & Rodgers 1988), which very likely 
compounds problems associated with shortfalls in resources. 

Not all of the literature cited above focuses on the beginning school 
transition specifically. Nevertheless, there is firm evidence that children's 
adjustment difficulties during this period are socially patterned, especially by 
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family economic status but also by ethnicity and family type. The question, 
then, is, do these early difficulties have lasting consequences, and does the 
organization of schooling magnify or moderate them? Evidence on the lasting 
effects of early education for later schooling outcomes is reviewed in the next 
section; the question of institutional influences that overlap the beginning 
school transition then is addressed. 


LASTING EFFECTS OF EARLY SCHOOLING 


Reasons for thinking that children should be especially responsive to the 
experience of schooling at the point of entry are reviewed in detail elsewhere 
(Entwisle & Alexander 1989). For one thing, rates of learning during first 
grade, as reflected in gains on standardized tests, are ten times as great as 
they are in high school (Jencks 1985). For another, children in the primary 
grades are optimistic, eager to learn, and tend to form emotional attachments 
to their teachers. All these factors should heighten their receptiveness to school 
influence and are notably lacking in the upper grades. (The downward track 
in children’s academic self-esteem and liking for school is reviewed in Eccles 
& Midgley 1990.) 


Studies of Preschool Effectiveness 


Some of the best evidence on lasting effects of early schooling comes from 
evaluations of preschool. Lazar & Darlington (1982) reassessed the data from 
seven Head Start experiments carried out in the 1960s in which students were 
randomly assigned to experimental (preschool) and control (no school) groups. 
It turned out that the IQ gains from the preschool programs were small and 
transient, in line with earlier appraisals (Jensen 1969, McDill et al 1969, 
Bronfenbrenner 1974), and in line with assessments of other early interven- 
tions such as increasing kindergarten attendance from half to full days (Karweit 
1989, Entwisle et al 1987). But achievement test scores held up better over 
the long run, and both grade repetition (i.e. retention) and Special Education 
placement were significantly reduced for preschooled children. 

These positive findings for preschooling have not received the attention 
they deserve. Even though the short-term effects of raising IQ scores faded 
out, such short-term boosts in performance may still confer long-term benefits. 
Woodhead (1988, pp. 447-48) suggests how the short-term benefits could be 
converted into longer term gains. “If during the early grades these children 
were seen as smarter and more adjusted to the demands of the school, this 
could have been sufficient to trigger a more positive cycle of achievement 
and expectation .. [and] carry the child through the later grades long after the 
original cognitive [[Q] benefits had washed out.” In other words, the school’s 
response to the preschooled children while their IQ’s were temporarily 
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elevated could produce lasting benefits. Children may be easier for the first 
grade teacher to teach, for example, if their cognitive status is temporarily 
improved. Or the preschooled children’s parents may have become more 
favorably impressed with their children’s abilities (some analyses suggest 
this). Retention rates generally are highest in the first grade (Shepard & Smith 
1989), so only moderately better achievement in the early grades might be 
enough to avert retention or placement in Special Education. 

The Consortium report (1983: p. 43 ff) has information about even longer 
term effects from three programs for 192 youngsters who were no longer in 
high school at the time of the evaluation. Attendance at a preschool program 
had a direct positive effect on the likelihood of high school graduation. 
Additionally, 66% of students who had graduated from high school and had 
no retentions were employed compared to 41% of the others. These same 
youngsters also did better in adapting to “mainstream society.” Criteria for 
“adaptation” were (i) the student’s being in any type of educational program 
including high school, or (i1) being in the military, or (i1) being employed 
or temporarily laid off, or (iv) living with a working spouse/companion. Poor 
adaptation was indicated by not being employed or looking for work, being 
in prison, or being a nonstudent on public assistance. Many of these positive 
findings come from the Perry Preschool Project (Berrueta-Clement et al 1984), 
one of the experiments with the longest time frame and a particularly intensive 
intervention. 

Four explanations proposed to account for preschool effects—earlier is 
better, more parental involvement is better, highly structured is better, and 
longer, more intense is better— now are seen as too simplistic (Castro & 
Mastropieri 1986, Karweit 1989, White et al 1992). Notwithstanding, the 
conclusion seems secure that what happens to children during this critical 
period can alter their educational and later life trajectories in fundamental 
ways. (For overview see Haskins 1989, also chapters in Slavin et al 1989 and 
Slavin 1989.) 


Research on High School Drepout 


The preschool effectiveness studies look forward to see what ensues across a 
range of outcomes; dropout studies look backward to see what anticipates 
school discontinuation. Studies that look back far enough invariably locate 
precursors to dropout in children’s early school experience, and one such 
precursor is grade retention (Rumberger 1987, Wagenaar 1987). The timing 
of retention is not always defined in this research, so the link to early schooling 
must be made with care. Clearly, though, retention before high school is 
important (Fine 1991), and studies that have examined the connection between 
retention in the early grades and later dropout find strong ties, even with 
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measures of academic performance controlled (Stroup & Robins 1972, Lloyd 
1978, Cairns et al 1989). 

Dropout is not an isolated event that occurs in high school. Rather, it is 
the culmination of a process of disengagement from school that stretches back 
over many years (Astone & McLanaban 1991, Finn 1989), very likely back 
to children's earliest encounters with the school. But not just formal 
interventions like retention carry forward from the early elementary grades to 
dropout. Dropping out also links to absenteeism 1n the primary grades, to a 
history of poor academic performance in the early years, and to children's 
behavioral style, especially aggressiveness (Ensminger & Slusarcick 1992, 
Kaplan & Luck 1977). 


Lasting Interpersonal Influences 


Other later outcomes respond to early interpersonal influences like parents’ 
and teachers’ expectations. Early studies in the “Pygmalion” tradition 
(Rosenthal & Jacobson 1968) have overrated the potency of teacher expec- 
tancies, especially as the source of minority and disadvantaged children's 
academic difficulties (see Brophy 1983, Wineburg 1987 for review and 
commentary). Nevertheless, research is accumulating that teachers' (and 
parents") ideas about children's competencies are important. Teachers' ideas 
condition classroom dynamics in complex ways (e.g. Alexander et al 1987), 
and both parents' and teachers' influences in the early grades can be long 
lasting. For example, among middle class children, Entwisle & Hayduk (1988) 
found that parents' estimates of children's ability in third grade were a strong 
predictor of youngsters’ English/reading scores four to nine years later 
(explained variance in the later scores increased by over 1696 even with later 
cognitive ability controlled). For working class children, teachers' expecta- 
tions measured in grades one and two significantly predicted both English and 
math scores four to nine years later, again with current cognitive ability 
controlled. Likewise Stevenson & Newman (1986) found that Minneapolis 
children's grade 10 test scores in reading and math were significantly affected 
by teachers’ ratings in grades 2 and 5, and Hess et al (1984) found that 
maternal expectations for the child's achievement in preschool predicted 
achievement in grade 6. 

The precise mechanisms involved in explaining these long-term effects are 
debatable, but with later cognitive ability controlled, persistence in cognitive 
traits cannot be the answer. What seems most likely is that favorable 
expectations of significant others prompt superior performance in some 
students in elementary school (e.g. Rosenthal & Jacobson 1968). Children 
for whom teachers hold high expectations are held to stricter standards, called 
upon more, and are more often pressed for answers (Brophy & Good 1974). 
The favorable expectations could promote more learning in the early grades 
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and help in establishing high achievement levels. In other words, parents or 
teachers helped some of these children cope better during the critical period. 


Continuity in Achievement Patterns 


Finally, the high over-time stability in achievement patterns (Husen 1969, 
Bloom 1964) also very likely implicates early school experiences, although 
the specific causal chains need to be traced out, and research at this level of 
detail is just beginning (Alexander & Entwisle 1988, Reynolds 1989, 1991). 
Clearly, though, doing better during this period is important because children 
are launched into achievement trajectories that then persist. 

By the end of third grade most children’s achievement levels are fairly 
stable, and the quality of their performance by that age is usually a good 
indicator of long-term school performance. In Husen’s (1969) large cross-na- 
tional study, for example, both intelligence scores and teachers’ ratings in 
third grade were strong predictors of children’s long-term educational careers. 
Also, Kraus (1973), who followed children in New York City for over 20 
years, found that the most significant predictor of adult status was the score 
obtained on third grade reading achievement tests. High year-to-year stability 
in children's relative achievement rankings also is evident in the achievement 
trajectories of youngsters in the Beginning School Study. Those who tested 
at the lowest level in the group at the beginning of first grade continue to 
occupy low positions for the next eight years, as far as they have been traced 
to date (Alexander et al 1993b). 


INFLUENCES DURING THE BEGINNING SCHOOL 
PERIOD 


What processes underlie these stability effects? As a child starts first grade a 
number of institutional or bureaucratic forces are put into action, and these 
very likely are relevant. As mentioned, at that point a cumulative written 
dossier begins to be compiled, and it follows the student year after year. By 
this means, the reputations children earn at the beginning of school may help 
or hinder them for many years to come. Another institutional force relates to 
elementary teachers' own social origins (Alexander et al 1987). Low status 
and minority pupils experience difficulty in tbe classes of high status teachers. 
They are evaluated by these teachers as less mature, held to lower expecta- 
tions, and their year-end test scores and marks are lower. 

Other institutional forces, more impersonal, revolve around early educa- 
tional sorting and are analogous to high school tracking. The next section 
discusses three such mechanisms for "sorting and selecting" children that are 
of particular relevance to the beginning school transition: within-class ability 
grouping, Special Education, and retention. 
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Within-Class Grouping 

Administrative sorting arrangements in elementary schools are effectively 
“tracks” but are not usually considered under this rubric. Within-class ability 
grouping is one of the few aspects of early schooling that sociologists of 
education have studied at all extensively. Sorensen (1970) has referred to such 
arrangements as vehicles for the “organizational differentiation” of students. 
In the primary grades, grouping has many variants (McPartland et al 1987, 
Slavin 1987), but whatever the nominal basis for the distinctions drawn— 
readiness levels, competencies, students’ needs or interests, students’ likely 
futures—such arrangements are all prompted by a similar logic: to reduce 
heterogeneity in the instructional group. Although teachers at times have been 
suspected of bias in assigning poor children to low ability groups, quantitative 
studies find no direct effects of race or social class on assignment to ability 
groups at the elementary level (see Oakes et al 1992). 

In first grade more than 90% of elementary schools use within-class ability 
grouping for reading, so the practice is near universal (in contrast, about 25% 
of schools group for math in first grade—McPartland et al 1987). Three or 
four groups are the most typical arrangements, but as few as two groups and 
as many as five groups sometimes are used (Hallinan & Sorensen 1983). 
Additionally, group size in elementary schools is not related to group level, 
so low groups are not smaller, as might be expected (Hallinan & Sorensen 
1983). Given these figures, in first grade probably a third to a fourth of 
children experience bottom group placement in a given subject. 

Slavin (1987) concludes that within-class ability grouping can be beneficial, 
even for low group children, when it creates homogeneity on the skills being 
taught, when it is done flexibly, and when instruction is closely tailored to 
children's competence. Kulik & Kulik (1987). reviewing much the same 
literature as Slavin, find weaker though still positive support for within-class 
grouping, but they do not agree with Slavin about the conditions enhancing 
effectiveness. Reviews of the literature on grouping are mixed, which 
Archambault (1989) suggests may have to do with whether grouping in math 
or in reading is emphasized. 

Most commentators conclude that grouping disadvantages children in low 
groups and increases educational inequality (e.g. Rosenbaum 1980, 1984, 
Good & Marshall 1984, and other references cited in this section). Why 
grouping has such effects is not well understood, but placement in groups 
often determines the amount and type of instruction children receive (Barr & 
Dreeben 1983, Bossert et al 1984), and group process—interruptions and 
disruptions—tends to vary as a function of the group's composition (Eder 
1981, Felmlee & Eder 1983). An excellent overview of the literature on 
grouping and tracking through high school is available in Oakes et al 1992. 
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Gamoran (1986) examined effects of first grade reading group rank on 
number of words learned and found that effects of group rank were almost 
completely explained by group variations in number of words taught. But 
reading is a complex skill and hard to reduce to a single indicator. For 
example, Collins (1986) shows that children’s learning styles vary by ability 
group even when they are reading from the same basal reader. Children in 
low groups read word-by-word with teachers providing isolated decoding 
clues. This instructional style does not give beginning readers much chance 
to apply their knowledge of spoken language. In high groups, clauses, 
expressive intonation, and supposed emotional states of speakers are brought 
to children’s attention. This kind of instructional emphasis in high and low 
ability groups is confirmed in other reports (Alpert 1975). 

Reuman (1989) shows that reading group rank determines the ranges within 
which teachers assign marks. The children in low-ability groups get lower 
marks than children of the same tested ability who are in low-ability 
classrooms. Placements for reading are only loosely tied to test scores, so 
many children can be affected by this. Also, Pallas et al (1992) find that 
children’s first grade reading group assignments have effects (at least through 
the fourth grade) on standardized achievement scores in reading and math and 
on parents’ expectations for the child’s future performance (net of a set of 
control variables including race, socioeconomic status, sex, family type, 
parent expectation level, marks, and beginning-school test scores). Such 
influences on parents’ perceptions of children years later suggest that 
consequences, including symbolic consequences, of instructional grouping 
may be more substantial and widespread than is generally appreciated (e.g. 
Gamoran 1986). 

From the vantage point of this chapter, instructional grouping is just one 
component of an elaborate apparatus for “educational sorting” that confronts 
children as they begin their school careers. Grade retention (Shepard & Smith 
1989) and Special Education placement (Carrier 1986) are additional compo- 
nents. 


Grade Retention 


With national data on retention not available, a broad picture must be 
constructed by using data on enrollments below the modal grade for a child’s 
age group. Census data (1992) for 1990 show continuation of a trend, reaching 
back to 1980 and before, toward increasing proportions of below-modal 
enrollments among elementary youngsters. Overall, about 20% of 6- to 8-year 
olds and just under 30% of 9- to 11-year olds were enrolled below mode. 
Percentages are higher for boys than for girls and for minority youngsters 
than for whites. Analyzing enrollment data from 1980, Bianchi (1984) found 
“below modal” enrollment escalates rapidly as risk factors cumulate. For 
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example, about 50% of the males and 40% of the females in poverty level 
households where the head is a high school dropout were below modal grade 
for age. 

These figures are in line with retrospective reports for the NELS-88 eighth 
graders and from scattered local tallies. In the NELS-88 data (National Center 
for Education Statistics 1990), over 31% of children in the lowest quartile on 
an SES composite scale reported having been held back versus about 8% of 
the children in the highest quartile. For African-American and white young- 
sters percentages were 16% and 6%, respectively. 

Retention data compiled by Shepard & Smith (1989) for 14 states for 
1985-1986 show retention rates to be consistently the highest for first grade, 
usually by a considerable margin. First grade retention averaged just over 
11% for 13 states for which data were available, while the highest rate for 
grades two through six was a little over 8%. The high rate for retention in 
first grade is seen in Baltimore children as well. Over 16% were held back 
at the end of first grade, more than twice the rate as for any later year through 
year eight (Alexander et al 1993b). 

In Shepard & Smith's survey, grade specific retention rates in the 796—896 
range were common. These are consistent with cumulative retention rates on 
the order of 5096 in many areas of the country. This level of retention probably 
is typical for public school systems like Baltimore's, where by the eighth year 
in school almost exactly half the BSS students remaining in City schools had 
been held back at least once (Alexander et al 1993b, see also Fine 1991). 


Special Education 


special Education placement is less common but still far from rare. Since 
passage of Public Law 94-142 in 1975, practices in Special Education have 
changed (Heller et al 1982). From the mid-seventies to the mid-eighties the 
number of children of all ages receiving Special Education services went up 
substantially, from 3.7 to 4.3 million, or from about 896 of all 3- to 
2l-year-olds to just under 11%. The most recent available tally shows 
continuing increases, with the total in 1990—1991 reaching 4.8 million 
(Viadero 1992). The biggest increase was registered for children with mild 
learning handicaps: between 1976-1977 and 1985-1986 the percentage 
classified as Learning Disabled jumped from 1.8% to 4.7% of the total (Singer 
& Butler 1987). 

The criteria for placement in Special Education vary widely from place to 
place, and both the validity and reliability of classification procedures have 
been questioned. Learning disabilities are indicated by a marked discrepancy 
between ability and performance, but often the category is used as a catch-all 
(Singer et al 1989). Although practices vary, most children remain in Special 
Education once they have been designated for such services (Walker et al 
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1988, Edgar et al 1988). Most Special Education children receive pull-out 
services from regular classes (68% nationally in 1985-1986). 


EFFECTS OF EARLY TRACKING 


Administrative tracking of one kind or another—ability grouping, retention, 
Special Education—thus touches a great many young children, and for many 
it begins when their school careers begin. As noted, retention is much more 
common in first grade than later, and 1t is our impression that 1f retention is 
not effective in getting children up to satisfactory performance levels, Special 
Education often 1s the next step taken. Despite widespread concern that poor 
or minority youngsters are unfairly singled out in these ways, evidence in 
support of such inequities is not compelling. SES and racial/ethnic bias in 
ability group placements in the primary grades rarely show up 1n studies that 
pursue the matter systematically (Haller 1985, Haller & Davis 1980, Sorensen 
& Hallinan 1984). Although low SES and minority youngsters have higher 
rates of retention (Bianchi 1984) and are overrepresented in some Special 
Education categories and in some localities (e.g. Singer et al 1989, Finn 
1982), at least racial/ethnic patterns can generally be explained by other 
factors—SES differences and differences between groups in tested competen- 
cies (Dauber et al 1993, Heller et al 1982, Travers 1982). 

Many questions remain about all three practices, but skepticism about their 
effectiveness and concerns about their possible stigmatizing consequences are 
widespread (on grouping, see references already cited; on Special Education, 
see Heller 1982, Leinhardt & Pallay 1982, Madden & Slavin 1983; on 
retention, see Jackson 1975, Holmes 1989, Harvard Education Letter 1991, 
but also Karweit 1992 for a somewhat more positive assessment). There are 
many evaluations of each, but almost all studies examine just one of these 
practices in isolation from the others. This piecemeal approach may be a 
serious mistake, though. All are mechanisms of educational sorting, and so 
there are important conceptual parallels that warrant attention. There also is 
considerable overlap in children's placements, so they often are joined at the 
level of experience too. Many children are not just repeaters, or in Special 
Education, or in low reading groups. Rather, some children carry all these 
identities, and others carry two or more. The patterning of children's 
placements in multitrack systems has been almost completely ignored. 

In the Beginning School Study (Alexander et al 1993b), there are 585 
youngsters for whom first grade fall and spring reading group placements, 
retention status, and Special Education placements all are known. Sixty-five 
percent (or 382 children) had no such placements, so the majority were spared 
all these in first grade. In the fall, however, 28% were in the lowest reading 
group (2296 in the spring), 1496 were held back at year's end, and 7.496 were 
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designated for Special Education services. Of the 203 youngsters touched by 
these experiences, just 3596 had only one low placement; 82, or 40% of the 
total, had multiple low placements (65% if fall and spring groups are counted 
separately), with twelve different track placement configurations being 
represented in these children's experience. 

Studies that limit themselves to one dimension of early tracking risk 
misconstruing the source of children's difficulties. For example, 85% of the 
first grade repeaters also were in low reading groups or were receiving Special 
Education services. Such studies also risk underestimating effects of tracking, 
as there is virtually no research on how consequences compound across 
dimensions of tracking. 

The articulation of tracking from one level of schooling to the next, 
especially starting in the primary grades, also has yet to be studied. In contrast 
to most commentary (e.g. Jackson 1975, Holmes 1989, Harvard Education 
Letter 1991), our research on retention over the elementary and middle school 
years suggests largely positive consequences for single-year repeaters in terms 
of tests scores, marks, and attitudes toward self and school. These positive 
consequences were revealed by comparing children's standing before retention 
against their standing after retention in a prospective evaluation, which is rare 
(Alexander et al 1993b). However, it also reveals strong connections across 
Jevels of schooling in children's placements, and the implications of this have 
yet to be worked out. 

Much remains to be learned about how early tracking affects children during 
the transition into full-time school. Such inquiries must consider the many 
guises in which tracking presents itself during this period. Most studies that 
touch upon children's academic or socioemotional development in the early 
grades are targeted on grade 3 and above, and even fewer are sensitive to 
"settling in" and "transition" issues [exceptions would be Entwisle & Hayduk's 
(1978, 1982) earlier studies in Baltimore; the ongoing BSS project; Reynold's 
work (1989, 1991, 1992); and probably Barr & Dreeben's (1983) and Kellam 
et al's (1977) research, which are set in first grade, but do not develop the 
beginning school transition as a theme]. 


OVERVIEW 


This chapter shows that educational stratification begins in earnest when 
children start their formal schooling or just before, at roughly ages 3 to 7. 
This 1s a "critical period" in their lives, a special time, when for developmen- 
tal/cognitive and social/contextual reasons they are especially responsive to 
schooling. In these years they are launched into achievement trajectories, 
construct their academic self-images, and establish themselves as “paper 
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personas” with school dossiers that then shadow them up through high school 
and beyond. 

Mounting evidence testifies to the long-lasting and powerful effects of 
schooling during this critical period for children’s life chances and ultimate 
well-being. Still, except for within-class ability grouping, sociologists have 
paid little attention to the social patterning that effectively tracks students over 
this early period: (i) access to economic resources and/or preschools for 3 to 
5 year olds; (ii) the restricted socioeconomic variability in elementary schools 
that matches their neighborhood catchment areas; (iii) the administrative 
sorting of first and second graders—grade retention, ability grouping, and 
Special Education. 

Of three major aspects of social structure that have known consequences 
for early schooling—poverty, ethnicity, and family type—poverty has by far 
the strongest effects. In fact, most of the differences that emerge in young 
children's standardized test scores associated with ethnicity or family type 
can probably be traced to economic disadvantage that takes its toll in summer 
periods when school is closed and the resources needed to support learning 
are not replaced by the home. Differences in other school outcomes like marks 
and deportment, which depend on teachers' judgements, follow lines of 
ethnicity and family type, and affect test scores indirectly. 

Stratification research has neglected socially structured sorting mechanisms 
that affect children around the time of the beginning school transition, and 
its later articulations with stratification in middle and high schools as well. 
We believe that issues of early schooling should have a higher priority among 
sociologists interested in schooling and social inequality. We hope that the 
case made here will be a catalyst for such inquiry. 
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Abstract 


It is common to all types of longitudinal data that one observes some social 
unit from at least two points in time. With such data several opportunities for 
analysis arise that are not present in cross-sectional data, for example, to study 
change processes and to account for unobserved variables in a more robust 
manner. There are many types of longitudinal data. I review some recent 
advances in analyzing two types: event histories and panel data. In Part I of 
this article I focus on seven recent advances in analyzing event history data: 
(i) techniques for dealing with unobserved explanatory variables, (ii) pecu- 
liarities of various sampling frames, (iii) time-aggregation bias, (iv) discrete- 
time methods, (v) estimation procedures such as those based on Cox’s partial 
likelihood, (vi) local hazard-rate models, and (vii) continuous state space 
models. In Part II of the article I focus on recent advances in analyzing panel 
data, with an emphasis on how panel data allows one better to take account 
of unobserved variables in the types of static relationships usually estimated 
with cross-sectional data. 


1. INTRODUCTION 


Longitudinal data come in a variety of forms. Common to all types is that 
one observes some social unit at at least two points in time. With such data 
severa] opportunities for analysis arise that are not present in cross-sectional 
data, for example, to study change processes and to account for unobserved 
variables in a more robust manner. There are many types of longitudinal data. 
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I shall review some recent advances in analyzing two types: event histories 
and panel data. 

The first type discussed, event histories, are generated by so-called 
failure-time processes, and they take this form. The dependent variable—for 
example, some social state—is discrete or continuous. Over time it evolves 
as follows. For finite periods of time (from one calendar date to another) it 
stays constant at a given value. At a later date, which is a random variable, 
the dependent variable jumps to a new value. The process evolves in this 
manner from the calendar date—when one change occurs—to a later date 
when another change occurs. Between the dates of the changes, the dependent 
variable stays constant. 

Data on such processes typically contain information about the date a sample 
member entered a social state, for example, an employment state, the date 
the person was last observed in the state, and, if the state was left at that date, 
the value of the next state entered, and so on. 

In analyzing such data the foci are on what determines the amount of time 
spent in each state and on what determines the value of the next state entered. 
Typically, one would like to assess the effects of covariates on the amount 
of time spent in a state and on the value of the next state entered. 

I focus on several recent advances in analyzing event history data: (i) 
techniques for dealing with unobserved explanatory variables, (i1) peculiarities 
of various sampling frames, (iii) time-aggregation bias, (iv) discrete-time 
methods, (v) estimation procedures such as those based on Cox’s partial 
likelihood, (vi) local hazard-rate models, and (vii) continuous state space 
models. 

The second type of data considered, panel data, arises from a variety of 
processes, including failure-time processes. The central feature of panel data 
is that one records at regular intervals the state each individual in the panel 
occupies at each time an observation is taken. Usually the observation intervals 
are equi-spaced, so that one, for example, knows the states occupied during 
the first week of January each year. In such data one typically knows only 
whether a change in states has occurred between the dates of the panel, but 
not when. I review techniques for dealing with unobserved explanatory 
variables in static models for analyzing panel data. This is where panel data 
analysis has made its most impressive advances. For reasons give in the section 
on panel data, I believe that dynamic models for panel data have been less 
compelling, except, perhaps, in the case of studying diffusion processes (see, 
e.g., Tuma & Hannan 1984, Part III). 

Before proceeding to these tasks, I briefly discuss how this review updates 
the previous review of longitudinal methodology in the Annual Review of 
Sociology (Hannan & Tuma 1979). If the 1970s was the decade for progress 
in confirmatory factor analysis and loglmear models one might say that the 
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1980s was the decade for progress in methods for analyzing longitudinal data. 
Hannan & Tuma's (1979) review focused on event history analysis, analysis 
of panel data, and time series analysis. This review updates theirs in the two 
first areas. For event histories, the specific focus is on the seven topics listed 
above, topics that were mostly in their infancy or not introduced at all at the 
time of the last review. For panel data, I focus only on static models, 
specifically on procedures for taking account of unobserved variables, 
procedures that developed and diffused rapidly in the early 1980s. These were 
only briefly discussed in the last review (Hannan & Tuma 1979, p. 322), 
though many of the results from analyzing continuous outcomes in linear 
models were known at that time. The corresponding results for categorical 
dependent variables were developed in the early 1980s. 

Needless to say, many advances in the fields of event history and panel 
data analyses are not covered here. But more conspicuously, there are types 
of longitudinal data not discussed at all, such as time series and event count 
data, mainly because I lack competence in discussing these topics. Models 
for analyzing many of the other types of data not discussed here can be found 
in von Eye (1990). The two types of data discussed do however cover a large 
spectrum of the kinds of longitudinal data used in sociology. 


PART I: ANALYSIS OF EVENT HISTORIES 


2. The Hazard-Rate Framework 


Event history data are usually analyzed by means of hazard-rate models. I 
first define the basic terms in hazard-rate models. Then I go on to discuss 
recent advances in analyzing such data. 

Let T be a random variable denoting the amount of time spent in some 
social state and let 5, be a specific realization of T. Let Z be a random 
variable denoting the state entered when a transition occurs, and let z denote 
a specific realization of Z, where Z is categorical, that is, has a finite 
number of values. Let x denote a vector of explanatory variables, including 
the constant 1. 

The hazard-rate framework proceeds by dividing the duration interval 0 to 
t, into a set of smaller time intervals, in fact, into a set of infinitely many 
small time intervals, each of infinitely small length (e.g. Petersen 1991a). 
For each of these time intervals, one then specifies the probability that the 
focal state in question is left, and if left, the probability that a specific new 
state, say, z, was entered. In specifying this probability, one conditions on 
the fact that the state was not left prior to entry into the interval in question. 
The central concept is the destination-specific rate of transition, A (t | z,,x), 
where z, denotes the state occupied immediately prior to t. It is defined as 
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1,@t | z,,x) 2 limP(tsT «t +At, Z = z | T =t,z,x) At. 
Ar} 0 (2.1) 


The destination specific rate of transition gives the rate at which a transition 
to state z occurs at duration f, conditional on no transition prior to t, that state 
z, was occupied immediately prior to t, and on the explantory variables x. For 
z = Z, the rate is zero. When the rate is small, say, .05 or less, it can for all 
practical purposes be interpreted as the probability of transition in the next 
time unit (see Petersen 1991a, Sect. 3). 

Let z' denote the number of possible destination states. The overall rate of 
transition at duration 1, irrespective of the destination state, follows by 
straightforward probability calculus as 


t 


Mt | 209 =È NC | z2), 
ved (2.2) 


because the z' states are mutually exclusive. 

Let P(Z = z|T = t,z,,x) denote the probability that state z was entered, 
given that a transition occurred at t, given that state z, was occupied before 
the transition, and given x. It is defined as 


P(Z =z | I1—62,x) ^hmP(Z =z | t €ET«t +At,z,x). 
Ar Jo (2.3) 


Since the states are mutually exclusive and exhaust the possible transitions, 
we get 


» P =z | TI—h5zX)-—L. 
z, (2.4) 


Now, using probability calculus, one can decompose the destination-spe- 
cific rate of transition as 


A, | 2,X) = A(t | zx) X PIZ = zT = t,z,,X], (2.5) 


that is, into the overall rate of transition times the probability of the destination 
state, given that a transition occurred. 

The survivor function, giving the probability that no event occurs before 
t,, follows as 
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P(T =4 | 2,3) =expl- L > Als | zs]. za 
z=] . 


Finally, let 
1 if a transition occurs in the time interval t to t -- Át, 
D(t4- A) (2.7) 
O if no transition occurs in the time interval t to t - At, 


From (2.7), we get the probability of a transition between t and t + At, given 
no transition prior to t and given z, and x, as 


P[D(t + Ar) = IT = t, zx] = Malz, x)At, for small At, (2.8) 


which can be made exact by replacing Ar on the right-hand side with Ar’, 
where Ar* < At (e.g., Petersen 1991a, Sect. 6). 

If, at most, one change in D can occur between t and t + At, it follows 
that 


D(t + An = X(t| Z,X)À + e(t), when T = t and Ar is small, (2.9) 


where e(t) is a stochastic error term with expectation 0, conditional on T = 
t, Z, and x. 

Equation (2.9) shows how the dependent variable in event history analysis, 
D(t + At), assuming that there is only one state to which transitions can be 
made, can be expressed as the sum of its mean plus an error term. For each 
individual, there are as many such 0-1 dependent variables as there are time 
units during which the individual is observed. In each interval where no 
transition occurs, this dependent variable is zero. In the interval where a 
transition occurs, the dependent variable equals 1. When Az | 0, there will 


be infinitely many such intervals for which one can specify (2.9). 


3. Unobserved Heterogeneity 


Unobserved heterogeneity refers to the situation where some of the covariates 
on which the hazard rate depends are not observed. The issue is distinct from 
the existence of an error term in hazard-rate models. As discussed in the 
section on The Hazard-Rate Framework, the dependent variable in analysis 
of event histories can always be expressed as a sum of its mean and an error 
term. 
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A(tlz,x, p.) = exp(Bx + dp), (3.1) 


where ja is the unobserved variable and 5 its effect; u may for example capture 
whether a person is a mover or a Stayer (see Spilerman 1972). 

Continuing the example of equation (2.9) in the previous section, the value 
of the dependent 0-1 variable D(t + At) corresponding to (3.1) can be written 
as 


D(t + At) = exp(Bx + 5y)At + e(t), when T = t and At is small, (3.2) 


where e(t) is an error term with mean of zero, conditional on z,, x, w, and T 
= ft. Equation (3.2) shows that p is distinct from an error term e(t). 

The unmeasured variable ju presents a problems in that even if its mean, 
conditional on z and x, is zero [i.e. E(w|z,,x) = 0], the mean of exp(5p), 
conditional on z, and x, will typically be different from zero (by Jensen's 
inequality, see e.g. DeGroot 1970, p. 97). For example, if u is gamma 
distributed with mean 1 and variance 1/c and we normalize 8 — 1, then the 
average rate, unconditional on p, for those who had not experienced an event 
by duration t becomes 


A(tiz,x) = o + exp(Bxy[t - exp(Bx) + o], (3.3) 


which depends on : (see e.g. Tuma & Hannan 1984, p. 179). The rate in 
(3.1) is independent of t, but when u is not observed, the rate in (3.3), 
unconditional on p, ends up depending on t. 

Therefore, when an unobserved variable enters the hazard rate, as in (3.1), 
it typically does not cancel out when it comes to estimation, as the error term 
e(t) in (3.2) does. To avoid biases in the estimates, one must take account of 
the unobservable in the estimation procedure. 

The most well-known problem arising from not controlling for unobserved 
variables is that the estimated hazard rate becomes biased toward negative 
duration dependence (see, e.g., Heckman & Singer 1984a, pp. 77—78). 
Intuitively, this occurs because individuals with high death rates on the average 
depart from the population at risk before those with low death rates, so that 
the rate in the population, not controlling for the individual frailty levels, 
declines over time. That this is the case can easily be seen from the rate in 
(3.3). By taking the derivative of (3.3) with respect to t one sees that the 
hazard rate declines with f. 

Several solutions to the problem of unobserved heterogeneity have been 
proposed, but two are prominent in the literature (see Yamaguchi 1986). 
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The first solution assumes, as above, that p is a random variable, commonly 
referred to as the random-effects procedure. The main approach in that case 
is to impose a specific distribution on p, say, normal, lognormal, or gamma. 
In terms of estimation, one first derives the likelihood for the observed history 
on an individual, conditional on observed and unobserved variables. There- 
after, one uses the imposed distribution of the unobservable to compute the 
mean of the likelihood when the unobserved variable is not taken into account. 
That is, one computes the average value of the likelihood, where the averaging 
is done over all possible values of the unobserved variable. This average 
depends on the observed variables, on the parameters of the hazard rate, and 
on the parameters of the distribution of the unobservable. This procedure is 
repeated for each individual in the sample. Thereafter, the likelihood of the 
sample is maximized by standard procedures (see Lancaster 1979). 

Neither theory nor data give much guidance when choosing the distribution 
for p. Heckman & Singer (1984b) therefore developed an estimator of the 
hazard rate that is nonparametric with respect io the distribution of u. They 
also suggested the use of the so-called EM alogrithm for computing the 
estimates (see Dempster et al 1977). In this formulation one specifies a 
multinomial distribution for ju. One then ends up estimating, besides the effect 
parameters of the measured variables and the effect of duration in state, three 
additional quantities, namely: the number of categories in the multinomial 
distribution assumed for p., the probability distribution for the categories, and 
the effect of each category on the rate. An example of this, in the case of a 
known number of categories, a binomial distribution, is given in (3.4) below, 
a formulation that has been known for some time (e.g. Nickell 1979). There 
is not much experience with the estimator suggested by Heckman & Singer 
(1984b), but conceptually it is highly appealing. Trussel & Richards (1985) 
report that the estimator is quite sensitive to the specification of the duration 
effect. Hoem (1990) reports that it is bighly sensitive to the number and types 
of explanatory variables included. Yet, it is probably too early to say what 
its ultimate value to the practitioner is, although the limited experience with 
it in the sociology literature is discouraging. Several alternative nonparametric 
approaches are currently being developed (e.g., Honoré 1990). 

The second main solution is to treat p as a fixed variable (Chamberlain 
1985), referred to as the fixed-effects procedure. While appealing in that few 
assumptions need to be imposed on p, this procedure has the drawback that 
it applies only to processes where the event is repeated over time and where 
one has observed at least two transitions on some of the individuals in the 
sample. Furthermore, and perhaps more restrictive, only the effects of 
covariates that change over time can be estimated. For example, one cannot 
estimate the effects of sex and race. 

It is instructive to consider how the random-effects procedure works. I 
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conduct a Monte-Carlo experiment. Suppose that there are two types of 
individuals, movers and stayers, as in equation (3.4). The movers have a high 
exit rate, whereas the stayers have a low rate. In the experiment I first specify 
a true underlying model, namely the rate 


(ln) = exp(Bo + òp), (3.4) 


which is a special case of (3.1), where jt is a dummy variable equal to 0 or 
1, indicating whether a person is a mover (+1) or a stayer (—0), and ô is its 
effect. I set Bo = — 2.302 and 8 = 1.609. The rates are then A(¢|p = 0) = 
exp(Bg = .10 and A(t| i, = 1) = exp(By + 8) = .50. This is a specification 
identical to that advocated by Heckman & Singer (1984b), except that the 
number of categories here is known, namely two. I use a straightforward 
BHHH (e.g. Petersen 1986a) rather than the EM algorithm proposed by 
Heckman & Singer (1984b) for estimating the parameters. 

I assume that 100 persons on each value on u enter the population each 
year—for simplicity, at the beginning of the year. In the data generating 
process there is no randomness in the distribution of p, so p is a fixed 
covariate. But this distribution is unknown to the researcher when p is not 
observed and must hence be estimated. With the assumed entry process, the 
distribution of u to be estimated ıs then binomial with P(p = 1) = q = .50. 

With these exit rates and the entry process, I used the program language 
GAUSS (Edlefsen & Jones 1988) to generate a data set (for further details, 
see Petersen & Koput 1991). I record the time of failure for each case within 
a cohort of entrants, in double precision, so there is no time-aggregation bias 
in the ensuing estimates (see e.g. Petersen 1991b; Petersen & Koput 1992). 
I let the process run in this way for a total of 21 years. Observations that 
failed during or after the 21st year of the process, as measured in calendar 
time (not individual duration), were treated as right-censored. Altogether, I 
generated data for 4200 observations. 

With these data I report estimates of four different versions of this rate 


Allu) = exp(Bo + Sp + yin 2). (3.5) 


In the first specification I control for jz by using its values in the data (0 or 
1). In the second specification I do the same, but control for duration (i.e. 
Int) as well. The true value of the duration effect y is zero. In the third 
specification there is no control for w (i.e. 6 = 0), but I control for duration 
(Inr). That specification corresponds to the rate with unobserved heterogeneity 
where no attempt is made to correct for it. In the fourth specification I exclude 
p. and control for duration (Int), but I try to correct for the unobserved 
heterogeneity induced by the exclusion of jp by using the random-effects 
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procedure. The objective is to see if I can recover the structural parameters 
Bo = —2.302, 8 = 1.608, and y = 0, as well as the distribution of p, q = 
.50, even when p is not observed. 

Table 1 gives the results. The first column gives the estimates of the true 
underlying model when p is observed. The structural parameters are quite 
well recovered. The second column gives the estimates of the true model with 
duration added. Again, the structural parameters are well recovered. In 
particular, the duration effect, whose true value is zero, is not significantly 
different from zero. The third column gives the estimates of the model 
excluding the variable ji. This is the model with an unobservable and no 
attempt to control for it. The duration dependence parameter is negative and 
significantly different from zero. Thus, the results confirm the well-known 
fact that unobserved heterogeneity tends to produce negative duration depen- 
dence, because as time passes, more and more low-death rate observations 


Table 1 Estimated death rates for data with and without unobserved heterogeneity using 
models with and without correction for unobserved heterogeneity (estimated standard errors in 
parentheses) Note The table is taken from Petersen (199]a, Table 3) Data were generated by 
the random numbers generator in GAUSS (see Edlefsen & Jones 1988, p. 424) programmed 
here with the probability integral transformation for going from a uniform to an exponential 
random vanable. The model used to generate the censored and noncensored failure times 1s the 
hazard rate in equation (3.4), using the parameters reported in column 5 below. See section 3 
for further details. The estimation routine is described in Petersen (19864). 


No unobserved Unobserved heterogeneity 
beterogeneity (u 18 unobserved) 
No age Age No control for Random-effects True 
effect effect unobservables procedure parameters 
Bo —2 304793  —2 315118 — 1 504491* —2 383728 —2 302585 
(.028050) (.009097) ( 018775) (.072344) 
ô 1 629827 1.640111 1.739276 1.609437 
(.036041) (.039564) (.089549) 
Y 009097 — 199622* -034272 000000 
( 014320) ( 012969) (.024927) 
q 508750 -500000 
( 029554) 
L“ ~-7,466 —7,466 — 8,356 —8,299 
N 4,200 4,200 4,200 4,200 


‘Significantly different from the true value of the coefficient at the five-percentage level (two-tailed 
tests) For the true values of the parameters see column 5 above. 
“This 1s the loghkelihood of the model 
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will be left in the sample. The duration dependence effect picks up this 
phenomenon when one does not control for the frailty levels of the observa- 
tions. The fourth column reports estimates of a model with an effect of duration 
but with no measured controls for jy. However, I attempt to control for the 
unobserved variable u by means of the random-effects procedure. Column 
four shows: (i) the structural parameters Bp and 8 are well recaptured; (ii) 
there is no spurious duration effect, that is, the parameter y is not significantly 
different from zero, and (iii) the parameter q for the distribution for p is well 
recaptured with an estimate of .508; its true value is .50. 

To conclude, the random-effects procedure yields good results in this case. 
However, note that the case studied was simple: I knew the correct 
specification of the hazard rate as well as the correct distribution of the 
unobservable w. In more complicated models with less a priori knowledge, 
it appears that these techniques for dealing with unobserved heterogeneity are 
less reliable (e.g. Hoem 1990). 


4. Sampling Plans 


In many samples of event histories, the sampling plan itself gives rise to 
problems in the analyses. Suppose one samples only the last two events on 
each individual in a target sample. In principle one should then construct an 
estimator that adjusts for the fact that only portions of the life histories are 
Observed (see Hoem 1985). If one does not adjust for the sampling plan, the 
problem that arises is that the life histories of individuals with long-lasting 
spells are oversampled relative to those with shorter spells. For example, in 
analyses of job mobility, sampling only the last two jobs, one would observe 
an entire ten years of the job histories for those with only two jobs during the 
last ten years, while a much shorter part of the job histories for those with 
three or more jobs. Thus, one assigns more weight to slow than fast movers, 
which in turn leads to an underestimation of the rate. Hoem (1985) discussed 
procedures for dealing with such problems. 

Another type of sampling plan obtains when one samples individuals who 
are in a certain state at calendar time tp, say, the state of being employed in 
a federal bureaucracy in January 1980. This is often called sampling from the 
Stock, as opposed to sampling from the flow, where entrants into a state are 
sampled. Even when one knows the dates at which the people in the sample 
as of vg entered that state, so there is no left censoring, this type of data 
creates problems. The standard estimators of the rate are no longer consistent. 
To obtain consistent estimates one must adjust for having sampled from the 
stock of those who survived in the state up until ry. The problem that arises 
in this case is that one oversamples those with long durations in a state, 
because those people are more likely to be found in the state at a given point 
in time. This in turn leads one to underestimate the rate. If in addition, one 
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does not know the time at which those in the stock as of tg entered the state, 
the problem of left censoring remains. In that case the problem is two-fold: 
first, the need to adjust for having sampled from the stock; second, the need 
to adjust for the left censoring of durations among those sampled from the 
stock. 

Stock sampling and procedures for dealing with left censoring are discussed 
in Lancaster (1979), Ridders (1984), Heckman & Singer (1984a), Allison 
(1985), Badyar & White (1988), and Hamerle (1991). Lancaster (1979) 
proposed a particularly appealing solution to the problem of stock sampling, 
a solution that yields consistent estimates, though not efficient, but that, unlike 
the maximum likelihood estimator, is computationally simple. The maximum 
likelihood estimator in the context of stock sampling is computationally 
cumbersome and is most likely quite sensitive to misspecification of several 
of the quantities that must be specified in the maximum likelihood framework 
but that are not needed in Lancaster’s estimator. The latter is based on a 
maximum likelihood idea but does not give a full maximum likelihood 
estimator. In terms of computation it requires a program that can handle 
time-dependent covariates. I recommend that this estimator be used in research 
(see Petersen 1993, Sect. 15). 


5. Time-Aggregation Bias 


The estimators used to estimate the parameters of the hazard rate typically 
assume that the available measures of time are continuous. The assumption that 
time is exactly or continuously measured is rarely met. Instead, researchers 
typically know that the amount of time spent in a state lies between, say, j-1 
and j months, but not the exact time within that window. Researchers then 
customarily set the duration in the state equal to j and treat this as the exact 
duration. Or, if censoring occurred between j — 1 and j, the censoring time is 
set equal to j. These practices give rise to some bias in the estimates. This bias 
is called time-aggregation bias—a problem that arises in virtually all analyses 
of event history data. Rarely is something done to deal with it. 

The most natura] solution to the problem of time aggregation is to develop 
estimators that take account of the grouping of the duration measurements. 
These estimators are derived from the assumption that the process evolves 
continuously in time, but they adjust for the fact that measurements of time 
are not continuous (see, e.g., Cox & Oakes 1984, pp. 56-60; Heijtan 1989, 
pp. 172-74). Specifically, for an observation that experienced a transition 
between j — 1 and j, the likelihood contribution becomes the probability of 
experiencing a transition between j — 1 and j. Each term in this probability 
can be derived from the hazard rate. The likelihood based on this probability 
yields consistent estimates of the parameters of the hazard rate. If, in contrast, 
the researcher assumes that the exact duration is j, the likelihood contribution 
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of the observation becomes the probability density of experiencing a transition 
at duration j. The likelihood based on this probability density yields 
inconsistent estimates of the parameters of the hazard rate. 

It would always be preferable to use the estimator that adjusts for the 
grouping of the duration measurements. Researchers almost never do so, in 
part because of convention, and in part for the following four reasons. First, 
it is often impractical to develop the relevant estimator. In many cases it 
would require writing special purpose estimation routines. Second, as 
discussed above, in the case with more than one state that can be entered 
when a transition occurs, the parameters of all the relevant rates must be 
estimated simultaneously when using the estimator that adjusts for the 
grouping of the duration measurements. Using the estimator that assumes 
exact measurements of durations allows one to estimate first the rate to state 
1, then the rate to state 2, and so on. Third, the estimator that adjusts for the 
grouping of the duration measurements requires the researcher to specify a 
model for the censoring process and to estimate its parameters along with the 
parameters of the failure time process. This is needed in order to deal with 
censored cases. Fourth, researchers may believe that the bias induced by time 
aggregation is negligible. 

For these four reasons, the estimator based on the assumption that durations 
are exactly measured is often used and seems, from a practical point of view, 
preferable in spite of its bias. Investigations of this estimator are therefore 
relevant. Petersen (1991b) considered a constant rate model with no covariates 
and no censoring, and he addressed two issues in connection with time-ag- 
gregation bias. First, Petersen (1991b) discussed the size of the bias when an 
estimator based on exact measurements of durations is applied to grouped 
measurements of durations. That discussion provided guidelines for when the 
problem is likely or unlikely to affect severely the conclusions reached. Two 
conclusions were reported. (i) For a given window (i.e. number of time units) 
within which the duration is known to lie, the higher the rate the higher the 
bias. (ii) For a given rate, the wider the window within which the durations 
are known to lie, the higher the bias. 

Second, Petersen (1991b) addressed the question of how one can minimize 
the time-aggregation bias when using an estimator based on the assumption 
of exact measurements of durations. It was shown, in the case of a constant 
rate, that the assigned duration ought to lie in the first half of the window 
within which the event happened. The higher the rate, the closer the assigned 
duration is to the beginning point of the window. The lower the rate, the 
closer the optimal duration ıs to the midpoint of the window within which the 
event happened. Petersen & Koput (1992) extended these results to a constant 
rate model with a single categorical covariate with several values and to the 
case of a fixed right-censoring scheme. 
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As a simple rule of thumb, one might therefore assign the duration that lies 
at the midpoint of the interval within which the event happened, a rule 
extensively used in estimation routines based on the life table (e.g. Cox & 
Oakes 1984, pp. 56-60; Breslow & Crowley 1974). 


6. Continuous- Versus Discrete-Time Models 


It is important to distinguish between processes that are continuous in time 
and those that occur discretely in time. In tke latter type of process, events 
can only happen at specific prespecified times. Examples are presidential 
elections (once every four years) and graduating from major universities (at 
the end of each semester or term). As discussed in the previous section, 
measures of most continuous-time processes occur only at discrete time 
intervals, or duration measures are rounded up to, say, the nearest integer. 
Nevertheless, the process itself is continuous in time. It just happens that our 
measures of it are not, a problem that may cause time-aggregation bias. If a 
process is continuous in time, it is clearly preferable to use continuous-time 
methods. If there is time aggregation in the duration measures, just adjust for 
this, as discussed in the last section. If the process is discrete in time, clearly 
one should use discrete-time methods. 

Because discrete-time methods as applied to continuous-time processes are 
nevertheless popular in sociology (see e.g. Allison 1982), a few comments 
are in order. Their central feature is that a logit, probit, or similar model is 
specified for the probability of an event in each measured time unit in the 
data, say, each month or each year. The main drawback of this procedure is 
that the coefficients estimated, say, from a logit or a probit model for each 
time interval, need not be entirely comparable to coefficients obtained from 
a continuous-time hazard-rate model. However, if the probability of an event 
in each time interval is small, then the coefficients obtained from the 
discrete-time specification for most models will be quite close to those 
obtained from the continuous-time specification. For example, the coefficients 
from a logit model will approximate the coefficients from a proportional 
hazards model (e.g. Weibull or Gompertz; quite well, as long as the 
probability of an event in each time interval is small. And only the constant 
term, which captures the rate for the baseline group, will depend strongly on 
the length of the time interval (see Arjas & Kangas 1988), whereas the other 
coefficients, which have the interpretations as relative risks, tend to be 
unaffected by changes in the time unit; this 1s also the case in a continuous-time 
proportional-hazards model. In discrete-time models, an often-used specifi- 
cation is the complimentary log-log function available in GLIM (see Baker & 
Nelder 1978). It provides consistent estimates of the proportional-hazards 
parameters (e.g. a piecewise constant rate), regardless of the interval length 
or (equivalently) the size of the rate (see e.g. Prentice & Gloeckler 1978, p. 
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58; Allison 1982, pp. 72-73; Arjas & Kangas 1988, p. 5). This property is 
not shared by the logistic model. If the objective is to obtain estimates of the 
parameters of an underlying continuous-time hazard-rate model, the compli- 
mentary log-log function has the advantage of doing so (see the discussion in 
Allison 1982, pp. 72-73). 


7. Estimation Procedures 


For models where the hazard rate depends on explanatory variables in a 
parametric manner three estimation procedures are common. Below I elaborate 
on each of these. There are also several nonparametric estimators of the hazard 
rate and the survivor function (see, e.g., Wu 1989), but these are mainly 
useful for descriptive and exploratory purposes and are less relevant in the 
kinds of multivariate situations with explanatory variables found in most 
sociological research. 

The first estimation principle is the standard ML procedure, which has been 
widely used in research. It allows one to estimate the effects of covariates as 
well as the effect of duration in a state on the rate (see e.g. Lancaster 1990). 

The second procedure is known as the partial likelihood (henceforth PL) 
principle. It can be used for so-called proportional-hazards models, models 
that have this form: 


A(t | x)= A(Hexp(Bx), (7.1) 


where h(t) gives the effect of duration in a state on the rate. The central feature 
of this specification is that A(f) enters multiplicatively with the covariates 
exp(Bx). Examples of the proportional-hazards model are the exponential, 
Weibull, Gompertz, and log-logistic models. Each of these models can be 
estimated by ML. 

Cox (1975) proposed an ingenious principle for estimating all proportional- 
hazards models. Using this principle one estimates the effects B of the 
covariates x but not the effect of duration A(t). In fact, one estimates B and 
at the same time lets A(t) be totally free. The only restriction on A(t) is that 
it enters multiplicatively with exp(Bx). 

In forming this likelihood the only information needed on the durations is 
their rank ordering, not their actual values. Some information is therefore lost 
in using this likelihood. Nevertheless, this estimator is asymptotically 
equivalent in distribution to the ML estimator (Efron 1977). Additionally, it 
does not suffer from the potential inconsistency of the ML estimator due to 
misspecification of A(t). 

The PL framework ts easily extended to the cases with repeated events, 
multiple states, and time-dependent covariates. The framework can also be 
extended to the case where two or more events happen at a given duration f+. 
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Developing the exact PL in that case is more cumbersome, but good 
approximations exist (see e.g., Kalbfleisch & Prentice 1980, ch. 3). 

The third procedure derives from a linear regression-type model. Instead 
of specifying the density of the completed duration f,, one specifies the 
expectation as well as the density of the logarithm of t}, Int}. For some models, 
for example, the exponential and the Weibull, the expectation has a linear 
form, yielding the linear regression model 


In 5, = B*x, + oe, (7.2) 


where 8* will be explained below and e, is a random error term and o a 
so-called scale parameter. This model is often referred to as the accelerated 
failure-time model. 

The distribution of e depends on which hazard-rate model one believes 
governs the process. For a Weibull model, e has an extreme-value distribution 
(see e.g. Kalbfleisch & Prentice 1980, p. 24). The parameters B* and o can 
be estimated by ML, which is straightforward once the distribution for € is 
chosen. Note that specifying the probability distribution for e is equivalent to 
specifying the probability distribution for Inr, as in a linear regression model. 
So, in the framework in (7.2) one specifies the density function for Inr, as 
opposed to for ft, as in the standard ML framework. 

Each parameter in the B* vector gives roughly the percentage change in 
the expected duration resulting from a unit increase in the corresponding 
explanatory variable in x. The parameter B* is related to the parameter B in 
the hazard-rate framework by B = —-*/a. In the Weibull model, o is related 
to the duration-dependence parameter y in equation (3.5) by y = l/o, while 
in the exponential model, o — 1 and the duration-dependence parameter is 
zero. Estimates of B* have the opposite sign of the estimates of the hazard-rate 
parameters B and may differ from it by a constant multiplicative factor 1/o, 
except in the case of the exponential rate where o = 1. 

Procedures relying on the specification in (7.2) are quite popular, mainly 
because they are available in some standard statistical software packages. 
They have at least two drawbacks. First, if one is primarily interested in the 
impact of variables on the rate, one has to translate the estimates of B* and 
g into estimates of B(— —B*/a). This requires also translating the estimated 
standard errors of Q* and o into those of B, which can be cumbersome. 
Second, the specification apparently does not allow for time-dependent 
covariates nor multiple states. Researchers occasionally use procedures based 
on (7.2) for dealing with time-dependent covariates (e.g., Hannan & Freeman 
1989, chap. 10). Doing so requires various short cuts, and the results obtained 
are at best approximations to the results from a correct likelihood specification. 
To my knowledge, the accuracy of these approximate results has not been 
investigated. 
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An approach often used in the statistical literature, the counting-process 
framework (Andersen & Borgan 1985), can also be used to derive the ML 
and PL estimators. A particularly powerful aspect of that approach is that 
deriving asymptotic results for the estimators becomes much easier, in 
particular for Cox’s PL estimator (see e.g. Gill 1984). 

In conclusion, the first two procedures, ML and PL, seem preferable in 
research. The choice between ML and PL is not easy. If one specifies a 
proportional-hazards model, for example, a Weibull model, and has no 
intrinsic interest in the duration-dependence parameters, then the PL procedure 
seems preferable. If one either has a direct interest in the duration-dependence 
parameters or if one specifies a nonproportional-hazards model, only the ML 
procedure is available. The third procedure dominates the first two only if 
one is solely interested in the percentage impact on the expected duration of 
changes in explanatory variables x, not in the hazard-rate parameters directly, 
and if one studies a process with no time-dependent covariates and no multiple 
states. 

In terms of software for estimating these models, several special purpose 
routines are available, some of which were discussed in Blossfeld et al (1989, 
ch. 6). Unfortunately, the situation is grimmer with respect to widely dispersed 
mainframe packages. BMDP (see Dixon 1990) and SAS (1991) provide 
procedures for PL estimation with and without time-dependent covariates, 
although the latter can be cumbersome to implement. LIMDEP (Greene 1991, 
ch. 50) can do ML estimation with and without time-dependent covariates for 
several models, based on the results in Petersen (1986a,b). To a large extent 
it utilizes the routine developed by Petersen (19862), which was documented 
in Blossfeld, et al (1989, ch. 6 and App. 2). Sometime in the future, BMDP 
will provide a full implementation of the procedure developed in Petersen 
(19862), allowing the researcher to estimate some 8-10 fully parametric 
hazard-rate models by the ML method. LIMDEP appears to be the only widely 
dispersed general-purpose program that does ML estimation of several 
hazard-rate models. LIMDEP and SAS can both estimate the accelerated 
failure-time model in (7.2). How to use several of these programs, as well 
as other programs such as GLIM (see Baker & Nelder 1978), is described in 
Blossfeld et al (1989, ch. 5—6). 


6. Local Hazard-Rate Models 


In Cox's proportional-hazards model, using the PL principle, one lets the 
effect of time spent in the state on the rate be quite free. In fact, the effect is 
totally free; it can vary from one time unit to another in an arbitrary way. In 
some substantive situations it might be of interest to let the effects of the 
measured covariates also be quite free, that is, to let them vary over time in 
rather arbitrary ways. Wu & Tuma (1990) developed a framework that allows 
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this, calling it a local hazards framework. The basic ideas are these. First, 
one chooses some parametric form for the effects of the covariates on the 
rate—for example, an exponential form. Second, one lets the effects of the 
covariates vary between time units in the data. The time units can be chosen 
a priori by the researcher or can be determined more empirically from the 
data. For the effect of duration itself, one can choose various parametric forms, 
such as an exponential, piecewise constant exponential, Gompertz, Weibull, 
etc. 

, Such a framework has the advantage of more correctly describing the data, 
and it will obviously give a better fit to the data than a more restricted model 
(see Wu & Tuma 1991). There are however at least two drawbacks. The first 
is that the number of parameters becomes larger than in a more restricted 
model, requiring one to have a large sample size. Second, even with a large 
sample size, it becomes harder to interpret and keep track of what goes on in 
the data the more parameters one uses to describe the data with. This, of 
course, is nothing but the issue of bounded rationality. We use models to 
summarize data in a manner that makes them more accessible to a reader than 
a mere listing of all the data points. The more nonparamatric a method 
becomes, the harder it also becomes for the reader and researcher succicintly 
to state what goes on. 


9. Continuous State Space 


If the state space is continuous, the framework of (2.1) must be modified 
correspondingly, as developed in Petersen (1988, 1990). Let Y be the random 
and now continuous variable, and let y denote a specific realization of Y. In 
specifying the destination-specific rate of transition, focus on the probability 
density of y being entered in a small time interval, given what has happened 
up to the start of the interval (see Petersen 1988, p. 144). The destination- 
specific rate of transition, A(t,yly,x), where y, is the state occupied im- 
mediately prior to t, is defined as 


A(t,y|y, x)= 
(9.1) 


lim P[t € T < t +At,y = Y < y +Ay | T = tyxl/AtAy. 
Arlo 
Ay 10 


The overall rate of transition follows, in a manner analogous to the discrete 
state space framework in (2.2), by integrating over all the destination-specific 
rates, namely 


At | ypx) = jo May, Jody, (9.2) 
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where D(y) denotes the domain of y. 

Define, in a manner analogous to (2.3), the density of the destination state, 
given a transition at duration ¢, given that state y, was occupied prior to 1, 
and given x, as 


BOJT —ty,x) = E Ply € Y < yc Mt S T € t t Aty,xy/Ay. 
A40 
Ay (0 (9.3) 


Since g(y[T = t,y,,x) is a density function for y, given that a change in y 
occurred at t, we find, analagously to (2.4) in the discrete case, that 


Joy SOIT 7133 71. (9.4) 


The destination-specific rate of transition can be decomposed into the 
overall rate of transition times the probability density of the destination state, 
analogously to (2.5) in the discrete state space framework, namely 


A(t,yly,) = Ay) X OT = ny). (9.5) 


For estimation one can either focus on the destination-specific rate directly 
(see Petersen 1990), as in (9.1), or on its decomposition into the overall rate 
of transition times the probability density of the destination state, given a 
transition (see Petersen 1988), as on the right-hand side of (9.5). In the latter 
case, a two-step procedure is available. One estimates first the overall rate of 
transition and second the density of the destination state, given a transition. 
An example is given below. 

The survivor function follows in complete analogy to (2.6) in the discrete 
state space case, as 


P(T = tyly,-x)=expl- f Í A A(s,yly, x)dyds], (9.6) 


which one may obtain either by specifying A(t,y | y,.x) directly as in (9.1) or 
by using the decompsition in (9.5). In the latter case, one first integrates 
gO | T = t,y,,x) over the domain of y, which yields 1. Thereafter, one integrates 
the overall rate of transition in (9.2) from 0 to t. 

I present an example of this framework, from Visher (1984), also reported 
in Petersen (1988, pp. 157-61). It uses the two-step procedure where one 
specifies the overall rate of transition and the density of the destination state 
given that a transition occurred, as in (9.5). The data were taken from the 
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Norwegian Life History Study for Men, directed by Natalie Rogoff Ramsgy 
at the Institute of Applied Social Research in Oslo and described in detail in 
Rogoff Ramsgy (1977, pp. 43-60). The Norwegian Central Bureau of 
Statistics collected and organized the data. A representative sample of 3470 
Norwegian men born in 1921, 1931, and 1941 were interviewed retrospec- 
tively on their life histories from age 14 up to the date of interview in 1971. 
Detailed month-by-month employment histories as well as histories on other 
life spheres were collected. 

The analysis focuses on the rate of upward shifts in socieconomic status 
and on the value of socioeconomic status after an upward shift occurred. An 
upward shift is defined as a job change that results in an increase in 
socioeconomic status over the highest level previously attained. Almost all 
changes in socioeconomic status in this data set are upward. Jobshifts leading 
to either no change or a downward change in socioeconomic status are treated 
as if no change occurred, since theories of intragenerational status attainment 
are primarily about gains in attainment and have little to say about downward 
and lateral changes in socioeconomic status (see Sørensen 1984, pp. 91-93, 
97). If a person holds more than one job before improving his attainment over 
the previous highest level, the duration before the upward shift is the sum of 
the durations in the jobs held since the previous highest level of attainment 
was reached. The measure of socioeconomic status (see Skrede 1971) runs 
from a low of 3 to a high of 52 and can for all practical purposes be considered 
continuous. 

The rate at which upward shifts occur depends on (i) the sector in which 
the person works (private or public), (ii) the highest level of socioeconomic 
status previously attained (1.e., y,), educational attainment (junior high school 
or less, or high school or more), (iii) occupational position (manager, 
professional, craftsman, or manual worker), (iv) labor force experience, and 
(v) duration since the last upward shift. Other than y, all variables are treated 
as time-dependent, including labor force experience. The latter is allowed to 
vary continuously with time since the last upward shift (as detailed in Petersen 
1986a, pp. 231—32). Visher (1984, p. 123) specifies the rate of upward shifts 
as (suppressing subscripts to individual observations) 


Milt y,x(t),L,] = exp[Bx(t) + py, + o(L, + f) + ye], (9.7) 


where B is a vector of parameters giving the effects of the covariates in x, 
which includes a constant 1, education, sector, and occupation, measured as 
of the job held immediately prior to duration ¢; L, is the employee's labor 
force experience (measured in months) at the date the last upward shift 
occurred; L,+t is the labor force experience at duration t after the last shift 
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occurred, with effect a; y, is the highest socioeconomic status previously 
reached and p its effect; and y is the effect of duration since the last shift. 

The specification for the new value of socioeconomic status, y,+;, given 
that an upward shift occurred, is 


yai = Ox, + By, + €, (9.8) 


where 6; is a vector of parameters giving the effects of the covariates in x, 
which includes a constant 1, education, sector and occupation; 6, gives the 
effect of the highest level of socioeconomic status previously attained, and e 
is a stochastic error term (see Visher 1984, p. 158). Sector and occupation 
are measured as of the job held immediately prior to the change in Y (if that 
job differs from the job held when status y, was entered). It is assumed that 
the parameters pertaining to the new value of Y, given a shift in direction d, 
differ for upward and downward shifts. Hence, we can correctly estimate (9.8) 
on the basis of upward shifts alone, with no correction for truncation, because 
there is no truncation problem, as discussed in Petersen (1988, Eq. [19]). 

In Visher’s specification, therefore, the instantaneous rate of an upward 
change in Y depends on its highest value previously reached, on the time since 
that value was obtained, and on the exogenous variables, as seen from (9.7). 
The density of the new value of Y, given that an upward shift occurred, 
depends on the highest value of Y prior to the change and on the exogenous 
variables, but not on time since y, was reached, as seen from (9.8). There is, 
however, nothing in the general model specification that prevents one from 
entering the time elapsed since status y, was achieved as a predictor in (9.8). 

Assuming that there is no autocorrelation in the e's and that the expectation 
of e, conditional on an upward shift and on the right-hand-side variables in 
(9.8), is zero, the parameters of (9.8) can be consistently estimated by linear 
least squares. No distribution needs to be imposed on the error term. If the 
latter is normal, least squares and ML coincide, and if not, least squares still 
yields consistent estimates, under the usual assumptions. The parameters of 
the hazard rate were estimated by ML (e.g. Tuma & Hannan 1984, chap. 5). 

Table 2 gives the estimates of equations (9.7) and (9.8) (taken from Visher 
1984, Table 5.2, col. 1 and Table D, panel B, col. 1). I will not comment 
on every number in the table. Instead, I focus on the conclusions from this 
analysis that one could not obtain solely from analyses of the rate of upward 
shifts or of the size of shifts. In the first column we see that managers have 
& lower rate of upward shifts than do the other occupational groups. On the 
average they wait longer before experiencing an upward shift (net of the other 
variables). From the analysis of upward shifts alone, as in Sørensen & Tuma 
(1981), one would conclude that managers are the most constrained in their 
opportunities for increasing rewards, a conclusion that seems plausible in light 
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of their already high rewards and the ceiling effects that may set in. In the 
second column, we see that managers on the average make the largest jumps, 
given that an upward shift occurred. From the analysis of the size of the gain 
alone, as in Sgrensen's (1974) difference equation model approach, one would 
conclude that managers are the least constrained in their opportunities to get 
ahead. Considering equations (9.7) and (9.8) simultaneously yields a more 
nuanced picture. The process of intragenerational mobility appears to differ 
between managers and the reference group, manual workers, in the following 
way. The former wait longer before they experience upward shifts, but once 
they shift, they also jump further. Managers climb in few, but long, steps, 
whereas manual workers climb in many, but correspondingly shorter, steps. 
The approach taken here to the study of continuous state space failure time 


Table 2 Estimates of the effects on the rate of upward shifts in socioeconomic status and 
of the density of the new socioeconomic status given that an upward shift occurred (standard 
errors in parentheses) Source. Visher (1984, Table 5 2 and Table D). For exact definitions 
of the sample and variables, see Visher (1984, chaps 5—6) The table was also published in 
Petersen (1988, Table 1, p 160). Note The data were taken from the Norwegian Life 
History Study for Men (see Rogoff Ramsgy 1977, pp 43-60, Visher 1981). 


Variables Equation (9.7)* Equation (9.8)* 
Constant —3 6420 ( 0416) 10.7800 ( 2052) 
Duration (in months), Y 0 0005*( 0004) 
Labor force expenence (in months), a —0 0050 (.0004) 
Socioeconomic status before shift —0.0780 ( 0033) 0 6353 (.0181) 
Education (1 = high school or more)? 0.6630 (.0399) 5 0980 ( 2272) 
Sector (1 = public, 0 = private) 0 0960*( 0520) —0.2470*( 2393) 
Occupation? 
Manager —0 7970 (.2564) 5.3560 (1 370) 
Professional —0 3608 ( 1342) —0.8122*( 7165) 
Craftsman —0.1081 (.0463) 1.1770 (.2630) 
Loglikelihood* —22727.4 
Number of Events‘ 6,523 3,730 


*Not significantly different from zero at the 05 level, two-tailed tests 
“These are ML estimates of the rate of upward shifts in socioeconomic status (from Visher 1984, 


Table 5.2, col 1) The estimation routine 1s described in Petersen (1986a) 


* These are estimates of 6, and 8, in the density for the new value of socioeconomic status, given that 
an upward shift occurred (from Visher 1984, Table D, panel B, col 1). The estimates were obtained by 
least squares, which coincide with the ML estimates when the error term 1n (9 8) 1s normally distributed 

* The reference category 1s educational attainment equal to junior high school, its equivalent (in years) 


or less 


‘The reference category 1s manual workers 


° Using a likelihood ratto test, we can reject the constant rate model, A(f) = A, against the model in 


(9 7) at any reasonable level of significance 


‘In column 1, the number of observed spells 1s 6,523, out of which 3,730 are noncensored In column 


2, the number of observed upward shifts 1s 3,730 
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processes, allows us to characterize the difference in the processes in this 
way. 


PART Uf: ANALYSIS OF PANEL DATA 


10. Panel Data 


In panel data y,,denotes the dependent variable for individual i in period t, x, 
denotes the explanatory variables, and e,, denotes the error term. The effect 
parameters pertaining to x, are p. 

I first consider single-equation models for continuous dependent variables. 
Then I consider some nonlinear models for binary outcomes. 

All models considered are static. I believe that the greatest advances in 
analyses of panel data have been made for static models, namely, the 
procedures for taking into account unobserved variables. Most dynamic 
models for panel data are artifical in the following sense. They are usually 
derived from an underlying diffusion process, that is, a process where the 
dependent variable changes all the time and changes in small amounts 
(Coleman 1968). However, most processes studied with these models are 
failure-time processes, such as changes in wages and socioeconomic status 
(e.g. Rosenfeld 1980). If the process is a failure-time process it should be 
studied by means of hazard-rate models, which may have to be adjusted to 
deal with the nature of panel data (see, e.g., Allison 1982; Petersen 1991b; 
Petersen & Koput 1992) but should definitely not be studied by means of 
models appropriate for diffusion processes. 

Two types of statistical models have been the focus for analyzing panel 
data, namely, fixed-effects (FE) and random-effects (RE) models. In both 
cases, an analysis of covariance methodology is used. 


11. Linear Static Models for Continuous Dependent Variables 


Consider the continuous dependent variable y,, and the regression equation 


yu = Bx, + Ey. (11.1) 


The least squares estimator of (11.1) is usually referred to as the total estimator 
Br. We shall also have use for the so-called average equation 


y, = Bx, + €,. (11.2) 
where one regresses the average value y, of y, across time on the average 


value x, of x, across time. The least squares estimator of (11.2) is usually 
referred to as the between estimator Bg 
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With repeated observations on each individual, one is in a position to 
strengthen the specification of the error term. The standard procedure is to 
include an additional person-specific error term o, that is time invariant. One 
appealing interpretation of o, is easily obtained by letting z; denote all 
unmeasured variables for individual i and letting 5 denore their effect 
parameters. Then, a, = 8z,. Now, a, captures the effects of all unmeasured 
time-invariant variables for individual i. A crucial assumption is that these 
unmeasured variables are time constant and that their effects do not vary over 
time. Define then the regression equation which includes œ, 


Yu = Bx, + a + €, (11.3) 


In this specification one has controlled for the unobserved variables z,. The 
question is, can we estimate such an equation? It turns out that there are two 
general procedures for doing so, referred to as the fixed- and random-effects 
procedures (e.g., Hsiao 1986, Ch. 3). 

In the fixed-effects procedure one treats a, as a fixed effect by including 
in the regression equation a dummy variable 7, for each individual, equal to 
1 if individual i and O otherwise. Then one estimates a separate effect œ, for 
each individual. This is referred to as the fixed-effects estimator or sometimes 
as the within estimator. The latter label is used because the estimates can also 
be obtained by, first, subtracting from the dependent variable y,, its average 
y, over time for individual i and, second, subtracting from each of the 
independent variables x, their averages x; over time for individual i. In 
computing the estimates, one must either set the constant term in x, to 0 or 
one must constrain the o,'s to sum to 0. The estimate of B becomes 


T N T 
By =È D Gs - x) Gu- x) ET, E On - 30 - 301. (11.4) 


As (11.4) shows, the estimator By uses the within individual-level deviation 
from its mean across time in the estimation and is hence referred to as the 
within estimator. 

This procedure has some wonderful things going for it but also some 
drawbacks. Its enormous strength is that one can control for all unmeasured 
variables z; and get consistent estimates of B. The unmeasured variables need 
not be independent of the measured, and the dependence between measured 
and unmeasured variables can be quite arbitrary. So, going from a cross-sec- 
tional to a panel set-up in the case of a linear regression model allows one to 
estimate the effects of measured variables and at the same time to control for 
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all unmeasured variables that are time invariant and whose effect parameters 
do not vary with time. 

The principal drawback of this procedure is that one can only estimate 
the effects of variables x, that vary over time. This can be seen from (11.4) 
above. Variables that are constant over time disappear because x, — x, equals 
zero for those variables. This means that one cannot estimate the effect of 
such variables as sex and race. Intuitively this makes sense, because the 
effects of all time-invariant variables in x, will be picked up by the effect 
of œ, where o, = 8z,. This drawback is major, because in most applications 
the researcher's interest rarely is in a subset of few variables that vary over 
time. A related but less major drawback is that individuals that experience 
no variation on some of the variables do not contribute to the analysis as 
far as these variables are concerned: information on those variables is not 
used for the individuals. This can sometimes reduce the sample size 
drastically relative to the equation in (11.1), as individuals with no 
across-time variation in all the variables do not contribute in the computation 
of the estimator. In summary, the fixed-effects procedure uses the data less 
efficiently, partly because observations that do not exhibit within-individual 
variation over time will not be used for estimating B, and partly because 
it uses up a large number of degrees of freedom in estimating the effects 
of the person-specific dummy variables. 

The second procedure is the so-called random-effects procedure. In this, 
at, is treated as a random variable. One then estimates B, the variance of a, 
(i.e., 02), and the variance of e, (i.e., 02). If no distribution is imposed on 
a, and ep, generalized least squares (GLS) estimation is used. The random- 
effects estimator Ber of B then becomes a matrix-weighted average of the 
within and between estimators, By and Bp respectively. 

There are some instructive special cases of the random-effects estimator. 
First, when the estimate of o2 equals zero, then the random-effects estimator 
Bre coincides with the total estimator Br. There are no person-specific effects, 
that is, no unmeasured variables, and the least squares estimator ensues. 
Second, when the ratio o2/0? grows large, then the random-effects estimator 
Bgg reduce to the within estimator (y. The intra-person variance dominates 
in estimating B. This makes sense. When the within-individual variation 
around the regression line for each individual, as measured by o2, given x, 
and a, is small compared to the between-individual variation in the regression 
line, as measured by o?, then the within estimator comes to dominate the 
estimate of 8. The most precise regression line is then the within-individual 
line. Third, when the number of periods T grows large, the random effects 
estimator converges toward the within- or fixed-effects estimator: The within 
variation dominates in estimating B. In my experience this convergence is 
quite rapid, so that T of 50 or so gives random effects estimators almost 
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identical to the fixed-effects estimator. Now, 50 periods is a very large number 
of periods 1n most panel data sets, but in other settings where these models 
can be used—all settings with data with a group structure such as samples of 
schools and the students within each school—50 or more observations within 
each group is quite common. 

The GLS estimators of o2 and o7 are instructive (e.g. Hsiao 1986, p. 38) 


N T 
oz = — —À 2 [Ou - y) - Bx, — xr, 
NT-)-Kc ^ (11.5) 
d is 2 d: 
eal ae WEE 0- Box) 1-7 o. 


(11.6) 


where K is the number of explanatory variables and N the number of 
individuals. We see here that o2 is computed as the average within-individual 
variance around the regression line, that is, as the variance around the within 
estimator. Further, o2 is computed as the between-individual variance around 
the regression line, that is, as the variance of the individual-level average 
around the between estimator, minus a correction based on o2. 

It is quite informative to discuss the numerical values of o2 and o2. The 
first tells something about the amount of variation between individuals in the 
impact of the unmeasured variables a,. If this is large, the unmeasured 
variables account for large differences between individuals, given x,. The 
second, of, tells something about the variation within an individual over time, 
given the measured x,, and the unmeasured o, variables. Comparing c? and 
o$, one can say something about whether the differences caused by 
unmeasured variables tend to be larger between individuals than within 
individuals over time, again, given xy. 

The random-effects procedure has drawbacks as well as advantages. Its 
main drawback is that one must assume that the unmeasured variables z,, or 
&, are independent of the measured variables. This is a strong assumption. 
Its violation leads to inconsistent estimates of p. 

Its main advantages, compared to the fixed-effects procedure, are first that 
one can estimate the effects of variables that are constant over time as well 
as of those that vary over time, and second that information on all individuals 
and all variables on each individual, even those that are constant over time, 
is used. 

The choice between the two estimators is not easy. When the number of 
periods T 1s large, the two more or less coincide. For small T the situation 
is quite different: the two can give diverging answers. The trade-off then 
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is that random-effect estimator uses the data more effectively, but one has 
to accept the assumption that the unmeasured variables a, are independent 
of the measured x,. Some tests are available for testing this assumption 
(see Hsiao 1986, pp. 48-49, see also Chamberlain 1982). This assumption 
is of course no different from the assumption always made in analyzing 
data from cross-sectional samples. Finally, if one wants to estimate the 
effects of time-invariant variables, the fixed-effect estimator is of no use. 
It seems that reporting both estimators may be the preferred solution. If the 
two differ significantly, there is cause for further speculation as well as 
investigation, for example, by testing explicitly for independence of a, and 


The framework discussed extends easily to the case where the number of 
periods T varies between individuals (Hsiao 1986, Ch. 8.2). Another 
extension, which has become quite popular in sociology, under the label of 
multilevel models, obtains when some parts of B are functions of exogenous 
variables (Mason et al 1983, Hsiao 1986, Ch. 6.5). 


12. Static Models for Binary Dependent Variables 
Consider the binary dependent variable D,, and the following logit equation: 


P(D, = 1x4) = exp(Bx,)/[1 + exp(Bx,)]. (12.1) 


As in the linear case, the panel data set-up may help one better to control 
for possibly unobserved variables that affect D,,. 

If there are unobserved variables in (12.1), maximum likelihood estimation 
yields inconsistent estimates of B, even when the unobservables are indepen- 
dent of the observables, unlike the linear case. Let, as in the previous section, 
z, denote all the unmeasured variables for individual i, let 8 denote their effect 
parameters, and let œ, = 5z,. Then 


P(D, = 1lx,,0)) = exp(Bx,  ojy[1 + exp(Bx,, + a)l. (12.2) 


In this specification one has controlled for the unobserved variables o, or z. 
As in the linear case, it turns out that there are two general procedures for 
estimating B, referred to as the fixed- and random-effects procedures. 

The fixed-effects procedure, unlike the case considered in the previous 
section, does not proceed by estimating the effects of dummy variables 
specified for each individual. Doing so would yield inconsistent estimates of 
B as long as T is finite, a result that generally holds when one goes from a 
linear to a nonlinear setting with individual-level specific effects (e.g. 
Chamberlain 1980). 
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Rather, from (12.2) one can compute a so-called sufficient statistic in which 
B but not o, occurs. This allows one to consistently estimate B (see 
Chamberlain 1980). The sufficient statistic is computed for each individual 
by conditioning on the sum of D, across t. The estimator is more complicated 
than in the linear case, but some software has implemented it, such as LIMDEP 
(see Greene 1991). The advantages and drawbacks of this estimator are the 
same as those of the fixed-effects estimator in the linear case: one need not 
assume that œ, is independent of x, but one cannot estimate the effects of 
variables that do not change over time, and the estimator uses the data less 
efficiently. 

In the random-effects procedure a, is treated as a random variable. One 
then estimates B and the distribution of a,. One may impose a specific 
distribution on a,, or the nonparametric estimator discussed in the section on 
Unobserved Heterogeneity for event histories can be used. The random-effects 
procedure has the same drawbacks and advantages as in the linear case. Its 
main drawback is that one must assume that the unmeasured variables z,, or 
a,, are independent of the measured variables. Its main advantages are first 
that one can estimate the effects of variables that are constant over time as 
well as of those that vary over time, and second that information is used on 
all individuals and all variables on each individual, even those that are constant 
over time. 

If instead of a logit one considers a probit model, a fixed-effects estimator 
is no longer available, only the random-effects estimator (Chamberlain 1980). 
The sufficient statistic required in order to compute the fixed-effects estimator 
is available only for a limited class of models. 


13. CONCLUDING REMARKS 


The analysis of linear static equations for continuous dependent variables using 
panel data has been thoroughly investigated. The results of these analyses are 
impressive. Moreover, these results apply in all situations where the data have 
a group structure: such as schools and students within schools and firms and 
workers within firms (see e.g. Petersen 1991c, 1992). The analysis of logit 
and probit models for panel data is also well established. One can still expect 
advances in analyzing nonlinear models for panel data, where the develop- 
ments probably will draw on analysis of covariance methodology, as has been 
well established for the linear case with continuous outcomes. 

Within event history analysis several topics are in need of research; two of 
them are procedures for adjusting for peculiarities of sampling plans and 
models for analyzing how two or more processes with mutual dependencies 
evolve over time. 


452 PETERSEN 


ACKNOWLEDGMENTS 


I thank an anonymous reviewer as well as John Hagan for constructive 
comments. For research assistance I thank Kenneth Koput. The research was 
supported by the US Institute of Aging, grant AG04367 and by the Institute 
of Industrial Relations at the University of California, Berkeley. The opinions 
expressed in this paper are those of the author. 


Literature Cited 


Allison, P. D. 1982. Discrete-time methods for 
the analysis of event histories. In Sociolog- 
ical Methodology 1982, ed. S. Leinhardt, 
pp. 61-98 San Francisco Jossey-Bass 

Allison, P. D. 1985. Survival analysis of 
backward recurrence times J Am. Stanst 
Assoc. 80 315-22 

Andersen, P. K Borgan, Ø. 1985 Counting 
process models for life history data. A review 
(with discussion) Scand J Statist. 12.97— 


158 

n E., bog anger, P. 1988. A discrete time 

e analysis of event histories 

SAGA Res. Rep. in Demography, No 
49 Stockholm: Univ Stockholm 

Badyar, N , White, M. 1988. A method for 
analyzing backward recurrence time data on 
residential mobility. In Sociological Meth- 
odology 1988, ed. C. C Clogg, pp. 105-35 
Washington, DC: Am. Sociol. Assoc 

Baker, R. I., Nelder, J. A. 1978. The GLIM 
System Oxford: Numerical Algonthms 


Grou 

Blossfeld, H.-P., Hamerle, A , Mayer, K U. 
1989 Event History Analysis Hillsdale, 
NK:Erlbaum 

Breslow, N., Crowley, J. 1974. A large sample 
study of the life table and product limit 
estimates under random censorship. Ann. 
Statis 2 437~53 

Chamberlain,G 1980. Analysis of Covariance 
with qualitative data. Rev. Econ. Stud. 47 
225-38 

Chamberlain, G. 1982. Multivariate regression 
t for panel data. J Econometrics 18. 


Chamberlain, G. 1985. Heterogeneity, omitted 
variable bias, and duration . In 
Longitudinal Analysis. of Labor Market 
Data, ed J J. Heckman, B Singer, pp. 
3-38 New York: Cambridge Univ. Press 

Coleman, J. S. 1968. The mathematical study 
of change. In Methodology in Social Re- 
search, ed H. M. Blalock, A. Blalock, pp. 
428-78. New York: McGraw-Hill 

Cox, D R 1975. Partial likelihood Bro- 
metrika 62:260—76 

Cox, D R., Oakes, D 1984 Survival Analysis 
London: Chapman & Hall 


DeGroot, M. H 1970. Optimal Stanstical 
Decisions New York: McGraw 

Dempster, A P, Lard, N., Rubin, D B 
1977. Maximum likelihood from incomplete 
data via the EM algorithm. J R Statist Soc. 
Ser B 39:1-38 

Dixon, W. J. ed. 1990. BDMP Statistical 
Software Berkeley, Calif Univ. Calif. Press 

Edlefsen, L E., Jones, S. D 1988. GAUSS 
Manual Kent, Wash: Aptech Systems 

Efron, B. 1977 The efficiency of Cox’s hke- 
lihood function for censored data J. Am 
Statist Assoc 72 557-65 

Gill, R. D 1984 Understanding Cox’s regres- 
sion model. a Martingale approach. J Am. 
Statist Assoc. 79. 441-47 

Greene, W. H. 1991 LIMDEP. New York 
Econometncs Software 

Hamerle, A 1991. On the treatment of mter- 
rupted spells and initial conditions in event 
ae analysis. Sociol. Methods Res 19: 
3 14 

Hannan, M T., Freeman, J 1989. Organiza- 
tional Ecology. Cambridge, Mass: Harvard 
Umv Press 

Hannan, M T, Tuma, N B. 1979 Methods 
for temporal analysis Annu, Rev. Sociol 
5:303—28 

Heckman, J. J., Singer, B 1984a. Economet- 
rc duration analysis. J. Econometrics 24: 
63-132 

Heckman, J J., Singer, B. 1984b. A method 
for minimizing the impact of distributional 
assumptions 1n econometric models for du- 
ration data Econometrica 52:271-320 

Heijtan, D. F. 1989. Inference from grouped 
continuous data a review. Statistical Sci- 
ence 4: 163-83 

Hoem, J M 1985. Weighting, misclassi- 
fication, and other issues in the analysis 
of survey samples of life histories. In 
Longitudinal Analysis of Labor Market 
Data, ed. J. J. Heckman, B. Smuger, 
p. 249-83. New York Cambridge Univ. 


Hoem, J. M 1990. Limitations of a heteroge- 
neity technique: selectivity issues in conjugal 
union disruption at parity zero 1n contempo- 
rary Sweden. In Convergent Issues in Genet- 


LONGITUDINAL METHODOLOGY 


ics and M ede d E J. Adams, O. A 
Lam, A. Hermalin, & P C. Smouse, pp. 
133-53 Oxford. Oxford | Univ. Press 

Honoré, B. 1990 Simple estimation of a dura- 
tion model with unobserved heterogeneity. 
Econometrica 58:453—73 

Hsiao, C 1986. Analysis of Panel Data New 
York: Cambndge Univ. Press 

Kalbfleisch, J. D., Prentice, R L. 1980 The 
Statistical Analysis of Failure Time Data. 
New York: Wiley 

Lancaster, T. 1979. Econometric methods for 
the duration of unemployment Economet- 
rica 47.939—56 

Lancaster, T. 1990. The Econometric Analysis 
of Transition Data. New York: Cambndge 
Univ. Press 

Mason, W M., Wong, G. Y , Entwisle, B 
1983 Contextual analysis through the multi- 
level linear model. In Sociological Method- 
ology 1983-1984, ed S. Leinhardt, pp 
72—103. San Francisco: Jossey-Bass 

Nickell, S 1979 Estimating the probability of 
leaving unemployment Econometrica 47 
1249-66 

Petersen, T. 1986a. Estimating fully parametric 
hazard rate models with time-dependent cov- 
ariates. Use of maximum likelihood. Sociol 
Methods Res. 14:219-46 

Petersen, T. 1986b. Fitting parametric survival 
models with time ndent covariates J 
R. Statist. Soc Ser C 35 281-88 

Petersen, T 1988. Analyzing change over time 
in a continuous dependent variable Specifi- 
cation and estimation of continuous state 
space hazard rate models. In Sociological 
Methodology 1988, ed. C C. Clogg, pp. 
137-64 ashington, DC: Am Sociol. 
Assoc. 

Petersen, T 1990 Analyzing continuous state 

failure time : Two further 

results. J Math. Sociol 10:247-56 

Petersen, T. 1991a The statistical analysis of 
event histories. Sociol. Methods Res. 19: 
270-323 

Petersen, T 1991b. Time-aggregation bias 1n 
continuous-time hazard-rate models. In So- 
ciological Methodology 199], ed by P V 
Marsden, vol. 21, pp 263-90 Cambridge, 
Mass. Basil Blackwell 

Petersen, T. 1991c. Reward systems and the 
distribution of wages J. Law, Econ, 
Organ. 7.130-58 

Petersen, T. 1992 Payment systems and the 
structure of inequality. conceptual issues and 
an analysis of salespersons in department 
stores. Am. J. Sociol 98:67—104 

Petersen, T 1993. Analysis of event histories 
In Handbook of Statistical Methods in the 
Social Sciences, ed. G. Arminger, C. C 
Clogg, & M. Sobel. New York: Plenum (in 


press 
Petersen, T, Koput, K. W. 1991 Density 


453 


dependence in organizational mortality pro- 
cesses: legitimacy or unobserved heteroge- 
neity? Am Sociol. Rev. 56°399-409 

Petersen, T , Koput, K. W 1992. Time-aggre- 
gation bias in hazard-rate models with cov- 
anates. Sociol. Methods Res 21: 25-51 

Prentice, R L., Gloeckler, A. 1978 Regres- 
sion analysis of grouped survival data with 
application to breast cancer data. Biometrics 
34 57-67 

Ridders, G. 1984. The distribution of single- 
spell duration data. In Studies in r 
Market Dynamics, ed. G R Neumann, N 
C Westergard-Nielsen, pp. 45-73. New 
York: Springer- Verlag 

Rogoff Ramsøy, N 1977 Sosial Mobilitet i 
Norge (Social Mobility in Norway). Oslo: 
Tiden Norsk Forlag 

Rosenfeld, R. A 1980. Race and sex differ- 
ences m career dynamics. Am Sociol Rev. 
45:583—609 

SAS/STAT User's Guide, Release 6 04 1991 
Cary, NC SAS Inst. 

Skrede, K. 1971 Sosioskonomisk Klassifiser- 
ing av Yrker 1 Norge, 1960 (Socioeconomic 
classification of Occupations in Norway, 
1960) Report 71—1. Oslo: Inst. Appl. Soc. 
Res 


Sørensen, A. B 1974 A model for occupa- 
tional careers Am. J. Sociol 80.44—57 

Sgrensen, A B. 1984 Interpreting tume depen- 
dency in career processes. In Stochastic 
Modelling of Social Processes, ed A. Diek- 
man, P Mitter, pp. 89-122. New York. 
Academic Press 

Sørensen, A. B , Tuma, N. B. 1981. Labor 
market structure and job mobility In Re- 
search in Social Srraficanon and Mobility, 
ed. D. Treuman, R V. Robinson, 1 69-81. 
Greenwich, Conn. JAI 

Spilerman, S 1972. Extensions of the mover 
stayer model Am. J Sociol 78:599-627 

Trussel, J., Richards, T. 1985. Correcting for 
unmeasured heterogeneity usmg the Heck- 
man-Singer procedure In Sociological 
Methodology 1985, ed. N B. Tuma, pp. 
242-76 San Francisco: Jossey-Bass 

Tuma, N. B., Hannan, M. T 1984. Social 
Dynamics Models and Methods. Orlando, 
Fla. Academic 

Visher, M G. 1981. Data documentation for 
the Norwegian Life History Data (translated 
from Norwegian by M. G. Visher) Madi- 
son. Univ. of Wisconsin, . of Sociology 

Visher, M. G 1984. The workers of the state 
and the state of state workers: a comparison 
of public and private employment in Norway 
PhD thesis Dep. Sociol., Univ Wisc., 
Madison 

von Eye, A. ed. 1990. New Statistical Meth- 
ods in Longitudinal Research Vols I, II. 
Orlando, Fla: Academic 

Wu, L L. 1989. Issues in smoothing empirical 


Annu Rev Sociol 1993. 19 455-67 
Copyright © 1993 by Annual Reviews Inc All nghts reserved 


RECENT MOVES IN THE 
SOCIOLOGY OF LITERATURE 


Wendy Griswold 
Department of Sociology, The University of Chicago, Chicago, Illinois 60637 


KEYWORDS: hterature, culture, reading, literary system authors 


Abstract 


Building on earlier work in the production of culture, reception aesthetics, 
and cultural capital, sociology of literature research during the past few years 
has concentrated on readers' construction of meaning and on networks within 
literary systems. New directions include studying the relationship between 
literature and group identities; connecting institutional and reader-response 
analyses; reintroducing the role of authorial intentionality; and developing a 
clearer understanding of how literature is and is not like other media. 


INTRODUCTION 


Several years ago when Priscilla (Clark) Ferguson, Philippe Desan, and I 
were discussing a special issue of Critical Inquiry that we were editing on 
The Sociology of Literature," I grumpily maintained that the sociology of 
literature was not really a field at all. Considering our mission, this was a 
rather awkward stance to be taking. What 1 should have said, precisely, was 
that the sociology of literature was more like a field of flowers than a field 
of battle. It had produced impressive theoretical assertions, brilliant but 
isolated insights, and rich veins of research findings, but it was not organized 
around key questions or debates the way a proper field ought to be. Now I 
find myself again, like someone trapped in virtual reality, compelled to 
describe the characteristics of this nonfield. To do so I shall switch metaphors 
from the geographical to the biological. The sociology of literature is like an 
amoeba: it lacks a firm structure, but has flowed along in certain directions 
nevertheless. In the following essay I try to describe its flow over the past 
decade, and suggest some future moves. 
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POINTS OF DEPARTURE 


Cultural studies made substantial advances during the 1970s and early 1980s. 
These advances included: the opening of Marxian theory mto a more 
dialectical understanding of the relations between base and superstructure 
(Williams 1980); a reconceptualization of popular culture that dropped the 
discredited mass-culture model in favor of one allowing for greater agency 
on the part of the users and manipulators of symbols (Hall et al 1980, 
Hebdige 1979); a sophisticated account of the uses of cultural capital to 
create or shore up economic capital (Bourdieu 1984, DiMaggio 1982); and 
a firm establishment of the point that cultural products are produced as the 
results of collective action (Becker 1982) and by organizational systems 
operating within markets and under various types of state controls (Hirsch 
1972, Peterson 1973). 

During the past decade a consolidation and extension of these gains has 
combined with Bourdieu’s insights regarding how wars of social status are 
fought on cultural fields and with cultural weapons. This combination has 
profoundly influenced the sociology of literature. For example, DiMaggio 
(1987) argues that the system of artistic classification found in a social system, 
in particular the genres used and understood by cultural consumers, should 
be understood as “ritual classification,” the outcome of formal social structural 
characteristics. Thus, a social system having high amounts of status diversity 
and complex role structures will tend to produce high degrees of generic 
differentiation; in such areas of great status diversity and complex role 
structures, however, the boundaries of ritual classifications among genres tend 
to be weak (students of postmodernism would point out that this characterizes 
contemporary culture in general). Looking at “cultural systems as totalities,” 
DiMaggio brings in production processes as well. Ritual classification 
processes respond to consumer demand—-consumers demand the conversa- 
tional cultural capital with which to pursue their social objectives. These 
processes are mediated by characteristics of production systems—commercial, 
professional, and administrative—through which art ıs produced and distrib- 
uted. Thinking along similar lines, Dubois & Durand (1989) suggest that, 
instead of looking for the connections between social classes and literature, 
it makes sense to think in terms of “textual classes”: genres classified both 
internally and among each other according to the “real, assumed, or 
constructed classifications of the different reading publics” (p. 142). Dueling 
genres replace struggling people on the battlefield, with the relationship 
between a textual class and (i) literary institutions, or (ii) general socioeco- 
nomic structure to be empirically investigated. 

Notice that both of these examples begin with the consumer of cultural 
products, specifically in the case of literature, the reader. Thus they represent 
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the connection recently being drawn between the cultural capital theories and 
reader-response criticism. 


THE READER AS HERO 


The most significant new direction taken by work in the sociology of literature 
in the past decade has been the reconceptualization of readers as creative 
agents rather than passive recipients of what authors write. Sociologists have 
embraced European “reception aesthetics” as a way to understand the 
construction of literary meaning. Proponents of reception aesthetics argued 
that the reader never comes to a text as a blank slate but instead places it 
against what Jauss (1982) termed a “horizon of expectations.” While Jauss’s 
conception of this horizon was primarily literary, social scientists have readily 
applied it to the different understandings and expectations that different groups 
or categories of readers bring to a single text. Authors will try to steer the 
process—every text has an “implied reader" (Iser 1974)}—but cannot control 
it. 

Reception aesthetics succeeded in transforming the research agenda in the 
sociology of literature, in part because it interacted with another theoretical 
development that was taking place among students of popular culture, one 
which accorded significant meaning-making ability to people themselves. 
Where mass culture theorists and hegemony arguments conceptualized the 
recipients of mass-produced cultural products as rather hapless, newer voices 
contended that people were more like bricoleurs, making meanings out of 
whatever was available to them. Moreover, these meanings often subverted 
the power relations presented in the content of the cultural object. This view 
was politically attractive because it respected previously despised genres— 
romance novels (Modleski 1982), television game shows (Fiske 1989), low 
riders and other symbols of working class assertiveness (Gottdiener 1985)— 
consumed by groups lacking social privilege and cultural capital; in these 
genres it found the sources of a certain wily resistance. 

Feminist studies of women readers and women's genres brought reception 
aesthetics and the new popular culture together most fruitfully. Women readers 
of formulaic romance novels, for example, whom academics formerly 
regarded as passive vessels into which mass culture poured its most mindless 
drivel, were reconfigured as agents, cultural actors making decisions and 
insisting on their rights, including the right to have time for their favorite 
form of relaxation (Radway 1984). The romances themselves were found to 
contain proto-feminist messages involving the revenge of the heroine on her 
male oppressors (Modleski 1982), the nurturing male (Radway 1984), and 
the successful and independent African woman (Griswold 1989). In a reflexive 
study that manages to make a virtue of the hermeneutic circle, De Vault (1990) 
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uses a more complex work of fiction to demonstrate how interpretation—spe- 
cifically, the interpretations of a Nadine Gordimer novel by initial reviewers, 
by scholars, and by DeVault herself—is collective and gendered. 

Not just gender, of course, but also class, occupational status, nationality, 
and life experience influence how readers read, and a number of sociologists 
have explored the nature of this influence. Howard & Allen (1990), seeking 
to delineate how men and women differed in their interpretation of two short 
stories, found that life experience overshadowed the modest gender differ- 
ences; readers who were older, married, and parents, for example, had more 
sympathetic reactions to complex characters than did young, single readers 
regardiess of sex. To the degree that gender does not make a difference in 
reader response, this may be due to the fact that little girls and boys continue 
to be introduced to reading via male-oriented children's books. Grauerholz & 
Pescosolido (1989) have shown how persistently sex stereotyping inculcates 
the youthful reader into a rather patriarchal horizon of literary expectations. 

Long (forthcoming), who has been studying middle-class reading groups 
in Houston, finds that these readers collectively construct a set of aesthetic 
criteria, which they then use for literary evaluation. These criteria are virtually 
untouched by trends in literary theory; for example, reading-group readers 
want "realistic" stories with "believable" characters with whom they can 
identify. In addition, they readily construct their own genres based on what 
they see in contemporary literature, one group's “women-in-pain” novels 
being a memorable example. "How quaint," scoffs the literary critic, but 
major gatekeepers in the business of literary production, for example 
Book-of-the-Month Club editors (Radway 1989), scramble to understand such 
processes of the local constructions of meaning and value. 

Because it involves inferences about meaning, setting reader-response 
criticism to sociological tasks runs the risk of a subjectivism not generally 
acceptable. While the interface between sociology and psychology is not made 
explicit in most such studies, sociologists have, in effect, been assuming a 
certain psychology of reading. According to this view, when we read 
literature, the text generates a series of mental associations, sometimes called 
"remindings," by which we relate the literary content to our own life 
experience (Dollerup 1991, Halász 1991). “For each individual, comprehen- 
sion is an associative and therefore memory-enriched process .. [it] depends 
on two factors: for how long attention is given to the sentence, and how 
effectively it cues the listener's own memory structures" (Nell 1988, p. 79). 
Such remindings may, like Rorschach blots, offer clues to individual 
psychology, but it is the pattern of readings—how certain groups are inclined 
to find certain meanings, and why—that interests sociologists (Griswold 1986, 
1987b, Larsen et al 1991). 

Schmidt (1982, 1991) has developed a theoretical model that is likely to 
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become increasingly influential as his work becomes better known to an 
English-speaking audience. Between a text and a reader Schmidt posits a 
"communicate," that point at which the decoding takes place. Decoding 
proceeds according to a set of conventions. The polyvalence convention allows 
the text to be decoded along several different lines, to have different meanings, 
either sequentially or simultaneously; in contrast, the monovalent convention 
looks for a single, correct meaning. Similarly, the aesthetic convention allows 
for ambiguity and does not demand truth as a criterion, as the fact convention 
does. This model offers greater precision in understanding reading response. 
For example, readers having identical sociological characteristics, but who 
apply different conventions to their reading, might come up with different 
communicates from the same texts (De Zepetnek 1992). 

Reading sacred texts demonstrates this clearly. Ammerman (1987) has 
studied conservative Christians whose socioeconomic profiles were identical 
to those of the middle class community they lived in, yet whose practices, 
which they maintained were based on the Bible, were very different. Such 
differences have traditionally been described as having a "literal" versus 
nonliteral approach to scripture; it might be more useful to think in terms of 
the application of aesthetic versus fact conventions. The questions would then 
be, do “Bible Believers” apply fact dnd monovalent conventions to other 
forms of literature as well? If not, what types of institutional triggers determine 
the conventions to be invoked? Schmidt's model allows for the convergence 
of a psychological approach, the microlevel of reading most compatible with 
reader response criticism, with a macroinstitutional level of analysis. 

The engagement of micropractices with macrostructures of social systems, 
such as institutions distributing power, is being shown to affect not just 
meaning making, but literary participation in general. Rogers (1991), applying 
a phenomenological approach, draws attention to the variety of ways the 
literature world (in which "expression and communication become ends in 
themselves," p. 8) interacts with the everyday worlds of both authors and 
readers. For example, she notes how both writers and readers have claimed 
that isolation (Woolf's A Room of One's Own) 1s a requisite for literary 
production and reception. Such isolation from the activities of daily life—the 
abandonment of the everyday world for the literary world—is more difficult 
for grown women to attain than for men (women are too busy with housework 
and child care, while men can legitimately demand isolation from the 
household); it is also more difficult for boys to attain than for girls (sex roles 
permit girls to be quiet and passive, while boys are encouraged to engage in 
more strenuous activities). Therefore, and despite the fact that youthful reading 
is virtually a prerequisite for a writer's development, Rogers’ research suggests 
why adult women are well trained as readers, but relatively under-represented 
as writers, while the reverse is true for men. 
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The second development of the past decade has been a surge in the study of 
how members of the literary system, at the point of production or consump- 
tion, organize themselves. In the 1970s the sociological study of culture was 
permanently changed by the establishment of the production-of-culture 
approach, which emphasized the organizational and marketing exigencies to 
which any cultural product is subject. Taking the “culture industry” not in the 
Frankfurt School sense of hegemonic producer of pablum for the masses but 
as a fact of cultural life in complex societies, sociologists like Peterson and 
Hirsch drew attention to the channels and conduits, the obstacles and filters, 
the middlemen and gatekeepers and boundary spanners, the wholesalers and 
retailers and media and critics, that connected creative artists with, or 
separated them from, their publics. While such an approach to culture was 
not altogether new (cf White & White 1965, Escarpit 1971), work in the 
seventies made it inescapable. Analyzing collective production of culture may 
have reached its apogee in Becker’s Art Worlds (1982), though Becker 
emphasizes interaction among human agents more than the purely industrial 
approaches did. 

As applied to literature, this production-of-culture approach has gone in 
four directions during the past decade. First, it has been incorporated into 
more traditional studies of the relationship between literary content and the 
social world from which it springs. For example, Long’s (1985) study of 
best-selling American novels over the 30 years following World War II shows 
that the best sellers’ attitude toward life organized around careers in large 
corporations shifted from one of enthusiastic affirmation to questioning, 
rejection, and finally nihilism. This might appear a straight reflection 
argument, but Long, who herself worked in publishing, adds a production- 
of-culture analysis to enrich her argument. She points out that in the early 
postwar era, publishing houses understood that the market for fiction was a 
general, middle class one. By the end of the 1950s and increasingly during 
the following decades, television supplanted reading as the mass entertainer; 
reading novels was more and more a pastime of the highly educated, the 
academics and professicnals, who maintained a more jaundiced view of 
corporate capitalism than the organization men and women themselves had. 
Publishers moved to satisfy this readership. So the change in bestsellers may 
have been as much in response to a shifting market as to a more general 
decline in belief in the American dream. 

A "strong program" of this combination of societal reflection and produc- 
tion-of-culture approaches coincides with recent work in cultural sociology 
which suggests that during times of social upheaval, ideological production 
by self-aware cultural innovators will increase (Swidler 1986, Wuthnow 
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1987). Kiser and Drass (Kiser 1985, Kiser & Drass 1987, Drass & Kiser 
1988) show that the publication of Utopian novels goes up during times of 
economic crisis in the world system. The outstanding example is the burst of 
utopian novels in the late nineteenth century, most notably Bellamy’s Looking 
Backward. Kiser (1985) relates this burst to the transition from free market 
to monopoly capitalism. Bellamy and other utopian writers did not just 
passively reflect economic changes; they were from the middle-class stratum, 
which had more cultural than economic capital and was most affected by these 
changes, and they were writing for a readership that was similarly worried 
about dispossession. 

The second type of production-of-culture extension has been toward 
examining publishing and other institutions in terms of their capacity to 
exclude or promote. Both feminist thinking and the deconstructionist attention 
to the artificiality of “the canon” have drawn attention to the selectivity of 
publishers and media gatekeepers. Tuchman (1989) studied how Victorian 
women writers were “edged out” of their dominant authorship position by at 
least one nineteenth-century publisher, once men had caught on to how 
profitable writing fiction could be. She offers an absorbing case study of 
invasion and exclusion (see also Rogers 1991, especially Ch. 9; for a telling 
example of promotion, in this case the promotion of Derrida by and among 
American literary critics, see Lamont 1987). Grauerholz & Pescosolido (1989) 
show how literary as well as social context influences what gets published, 
specifically the presentation of females in children’s books. In the early 
twentieth century, there was a vogue for “groups” at the center of children’s 
books [although the authors do not cite it, The Wizard of Oz and its many 
successors are a prime example]; collective protagonists generally involved 
both males and females. By the 1920s and 1930s, groups had receded and 
folk or hero stories, typically featuring a single male protagonist, were 
common. Thus, and contrary to the authors’ expectation, children’s books 
written during the early part of the century were actually more egalitarian in 
terms of gender representation than those of mid-century. Not that they were 
ever all that egalitarian. Over the century, the ratio of male to female central 
characters in children’s books has been 3:1. The recent trend toward a 
somewhat more equal presentation has been counterbalanced by the relentless 
masculinity of animal characters, which appears to be increasing; over the 
century, male animals have outnumbered female animals as central characters 
by an astonishing 6:1 ratio. 

Publishers aren’t the only gatekeepers, of course; government bodies and 
laws, either through the carrot of support or the stick of censorship, accomplish 
exclusion and channeling as well (Dubin 1992, Beisel 1990). While this effect 
is well known, it is not always straightforward. Berezin (1991), for example, 
asks a dog-that-didn’t-bark question: why did Italian Fascist theatre not 
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produce plays having explicit Fascist content? Although Mussolini stressed 
. the role of theatre as ideological weapon, in fact most plays performed during 
the 1922-1940 period were apolitical, love stories and other light entertain- 
ment. Berezin argues that states can control the content of cultural products 
and/or their production. If the regime controls both, it is totalitarianism; if 
neither, pluralism; if just content, cultural protectionism; if just production, 
state paternalism. State paternalism characterized cultural policy in Fascist 
Italy. The Fascists developed an organizational bureaucracy (which was huge 
because they had to employ a large number of baccalaureates—intellectual 
laborers) that marginalized writers and emphasized production in terms of 
sheer numbers while more or less ignoring content. There was little explicit 
censorship; the focus was on “doing theater" rather than “writing plays," and 
the more the better. Berezin contrasts this with the more explicitly totalitarian 
theatrical policies of Nazi Germany and Stalinist USSR and suggests that 
these latter had clearer ideologies to begin with—Fascist ideology was torn 
between bureaucratic rationality and romantic revolutionism—so they were 
better able to define and control the specifics of content. 

Third, Clark (1987) has pointed out that literary institutions, the systems 
of production and distribution associated with the production-of-culture 
school, should be distinguished from literary culture. A literary culture is “a 
constellation .. of mutually sustaining institutions, ideologies, symbols, and 
codes” (p. 8) that is enacted through a wide variety of social practices and 
that mediates between literature and society. A specific literary culture is the 
product of historical and geographical circumstances. In the case of France, 
a high degree of political and sociocultural centralization along with a lingering 
memory of aristocratic patronage that market-weary writers cherish has 
produced a literary culture distinguished by the public honor of writers; this 
literary culture shows up in everything from a proliferation of streets named 
after authors to advertisements suggesting an analogy between the satisfactions 
of winning a literary prize and of wearing a certain brand of underwear. In 
such a culture the “public writer" flourishes; such writers are intellectual 
heroes like Victor Hugo or Sartre whose political and literary personas are 
fused. Political leaders and the general public acknowledge the public writer’s 
role even if they neither read the writer’s books nor agree with his or her 
politics. Clark tells of how President de Gaulle, when rejecting a political 
petition of Sartre’s, nevertheless deferentially addressed him as “Cher Maitre.” 
China is another literary culture that assumes a “near synonymy of literature, 
morality, and politics” (Link 1983, p. 5; see also Link 1984, Link et al 1989), 
which is the reason why the government is at such pains to control its writers 
through various “winds” that indicate the currently correct lines. The relative 
impotence of the general public in China makes no “Cher Maitre” gestures 
necessary. In contrast, writers in many African states have a great deal of 
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freedom, not because they are supported by a strong literary culture but 
because of its very absence; as Nigerian novelists are wont to say, "the generals 
don't read" (Amuta 1986, Griswold 1992). 

Fourth, various forms of network analysis and clustering techniques have 
been used to map systems of literary production and reference. For some time, 
network analysis has been essential for understanding publishers as gatekeep- 
ers and readers as information processors. Coser et al (1982) emphasize the 
role networks play in the book business, and Powell (1985) follows up with 
an intensive study of academic publishing. Science fiction constitutes a 
different literary system from academic publishing (or does it?), but 
Bainbridge (1986) demonstrates that the world of the science fiction buff is 
similarly highly structured and dense with connections. 

Network thinking is mapping literary culture itself. Using data from a survey 
of writers in a (formerly West) German city, Anheier & Gerhards apply 
clustering techniques to illuminate some of the characteristics of the German 
literary culture. In one analysis (19912) they looked at writers who acknowl- 
edged being influenced by other writers versus those who denied such 
influence. What literary critic Harold Bloom has termed the "anxiety of 
influence" was expressed in denial, but the anxious writers tended to be the 
younger, less educated, and marginal to the literary system; more established 
writers readily acknowledged and located their work with respect to other 
writers. A second paper from this research uses block modelling to argue that 
myths about writers bear a close affinity to the social structure of the literary 
system (Anheier 1991b). At the center of the system is an elite in itself but 
not for itself, "not a group, but a set of individuals who tend to occupy unique 
structural positions" (p. 823; emphasis in the original). This “amorphous” 
elite is prominent and acknowledged by non-elite writers, who themselves 
constitute more coherent groups, but it is neither cohesive nor in structurally 
equivalent relations to the non-elite. This combination of an amorphous elite 
and peripheral groups corresponds to the myths of the (elite) writer as lonely 
genius and of the (non-elite) writer as poor poet or misunderstood genius. 

An application of network analysis into the realm of cognition, involving 
the literary frame of reference used by book reviewers in the mainstream 
press, is found in Rosengren's work on the "climate" of literature in Sweden 
(1983). Using a reviewer's "mention" of a writer other than the author of the 
work under review as his unit of analysis, and utilizing his earlier work on 
Sweden's literary system in the late nineteenth century as a point of 
comparison with postwar decades, Rosengren attempts to differentiate long- 
term, possibly invariant, patterns (for example, the tendency of reviewers to 
mention writers a little older than themselves and to draw from a very narrow, 
recent pool of names instead of utilizing the full pool of literary references 
available) from short-term trends, some of which are so short-term as to be 
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more properly seen as literary weather. His findings are remarkably intriguing. 
To give only a couple of examples, we learn that changes in the literary frame 
of reference were considerably less dramatic in the 1960s than in the 1880s; 
while the literary field was larger in the later period, and therefore the mentions 
more various, the rate of change was slower, and new schools—for example 
the French “new novel," or Marxist writers—-have not had as much impact 
as one might have supposed. Although Rosengren doesn’t specifically 
compare modernism to postmodernism, his data suggest that the latter may 
represent a less fundamental cultural change than the former. A second 
example: While the literary frame of reference has become more democratic 
since the 1960s, with popular genre writers mentioned alongside serious and 
even classical authors, it has not become more egalitarian in terms of sex. A 
subsystem seems to have emerged wherein women reviewers of books by 
women are more likely to mention other women authors, but men or women 
reviewers of male-authored books have not increased their mention of women 
authors, at least not through the mid-seventies. Indeed, Rosengren’s data 
suggest that literary ghettoization is taking place in a number of areas beyond 
women writers. He sees the literary system as breaking down into a tripartite 
system; instead of the old high/popular distinction, his data suggest a high 
literary level, a middle-brow level, and a popular level of genre fiction heavily 
influenced by other forms of popular culture, especially music. 


NEW MOVES 


A review of recent work in a field (or nonfield) inevitably tempts one into 
speculating about future directions and perhaps trying to steer toward them. 
Developments in the past several years, both within the sociology of literature 
and within the social world, suggest a four-item agenda. 

1. Recent ethnic and nationalist struggles prompt a new look at the 
relationship between literature and identity. Studies are needed of how 
macrostructures of language politics and regime transformation interact with 
microprocesses of reading and meaning construction, with institutions of 
education and literary production mediating. These studies will help us to 
understand what part literature plays in the ongoing process of identity 
construction and maintenance—a process for which people continue to be 
willing to die. 

2. Connections should be made between the institutional and the reader- 
response modes of analysis. The work of the Tilburg group in the Netherlands 
is moving in this direction. These researchers are firmly in the institutional 
camp, studying such things as literary magazines (De Nooy 1991), book 
marketing (Verdaasdonk 1992), and literary critics (Janssen 1988, Van Rees 
1989). But such institutions may be understood not simply as gatekeepers but 
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as influences on the conventions through which, by the construction of 
communicates, reading comprehension takes place. Van Rees (1989) suggests 
this when he criticizes the model of literary criticism that says that the critic 
describes, interprets, and evaluates a work; the description itself, the use of 
concepts like “character” or “balance,” is not objective but a convention 
legitimated by positions of cultural authority. Yet these descriptions help set 
the conventions though which other readers will approach the text and develop 
their own meanings. It is perhaps refreshing to know that a good many readers 
ignore what the cultural authorities have to say (Janssen & Leemans, 1988). 

3. Sociologists should rediscover that forgotten soul, the author, who has 
been deconstructed into oblivion. It may seem a sign of theoretical naivete, 
but it is a sign of common sense as well, to remind ourselves that human 
agents create the literary objects under consideration (Griswold 1987a). If a 
literary object is "shared meaning embodied in form,” then the creative agent 
may be someone other than the actual writer (for example, the guiding editor 
of a line of formulaic fiction—whoever sets the form). That said, there is no 
reason why authors, with their intentions, experiences, sociological charac- 
teristics, and "horizons" of understanding, cannot be treated in parallel fashion 
to readers: as agents who interact with texts, working to encode meanings 
(which may or may not be decoded by any particular group of readers—for 
all its sins, deconstruction has surely profited us by establishing the unreliable 
nature of texts). 

4. The relationship between printed literature and other cultural forms 
and media, especially in a context of cultural globalization, needs to be 
theorized and empirically examined. Practitioners of the sociological study of 
literature (and art) tend to confine their subject to an elite ghetto, perhaps 
because they fear it will be dissolved into just another form of communica- 
tions. This seems a misplaced concern. For example, both work on literary 
cultures and on understanding the application of conventions in reading 
suggest that reading is, at least to some degree, different from information 
processing or passive entertainment. We should move toward a better 
understanding of these differences. 
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Abstract 


This essay reviews the changes in the age structure of modern labor markets 
that have led to a dramatic decline in the employment of older workers. We 
document this trend and explore future trends, examine different explanations 
of the role of the welfare state, the labor market, and the life course. Each 
of these perspectives offers different views into the social meaning of early 
retirement. 


INTRODUCTION 


Ás can be noticed when analyzing the abundant sociological literature, 
retirement refers simultaneously to an event, a social institution, a stage in 
the life course, and the transition from work to nonwork. Let us examine 
these multiple meanings. 

For Atchley (1976), among others, retirement is an event that occurs when 
a person definitively stops working and withdraws from the formal labor 
market. But Streib (1988) has pointed out that retirement cannot be reduced 
to an "inactivity," as Ransom & Sutch (1988) implied when affirming that, 
as of 1870, many American men were retired because they were no longer 
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engaged in economic activity. Ransom & Sutch did note that these “retired” 
men had stopped working mainly for reasons of health or age-related 
handicaps. Retirement is quite different from the inability to work due to old 
age and the consequent dependence on family, private charity, or public 
welfare. 

As a social institution, retirement corresponds to a person’s definitive 
economic inactivity, with income replaced by a pension (Streib & Schneider 
1971, Guillemard 1980, Palmore et al 1985, Parnes and Less 1985). This 
institution developed along with industrial society, as has been shown for the 
United States (Atchley 1976, Graebner 1980), France (Guillemard 1986b), 
and Germany (Kohli 1987). Thanks to retirement, the transition from the 
working life toward old age is regulated through formal criteria (age or number 
of years spent working); it is no longer bound up with functional criteria about 
an individual’s “fitness” for work. At first privately run and then generalized 
through public programs to a whole age-group, old-age pension systems 
guarantee beneficiaries a substitute income. As the wage/pension replacement 
rate has risen, old-age has gained prospects different from poverty and 
disability: “As a social institution, retirement has been the major means of 
redividing the life span so that old age can be identified by a common 
dimension, i.e. as a period of inactivity on a pension" (Guillemard 1980: 22). 

Hence retirement has also come to refer to a stage in the life course. As 
pension systems developed, a norm emerged whereby the life course was 
divided into three major phases: youth as the time for education; adulthood, 
for work; and old age, for inactivity. Old age and retirement were combined 
in a "Third Age" identified with and by retirement. The pension system has 
created a new stage between adulthood and advanced old age, the latter 
characterized by frailty, handicaps, and disabilities. As Donahue et al (1960: 
331) have stated, “Retirement is the creation of an economically nonproduc- 
tive role in modern societies, which are capable of supporting large numbers 
of persons whose labor is not essential to the functioning of the economic 
order.” 

As a stage in the life course, retirement comprises a status and a set of 
social roles that define retirees’ collective identity and endow it with social 
meaning. These contents are constantly reworked as a function of the 
expansion of pension systems, changes in standards of living, and retirees’ 
collective behaviors. Initially designed as a “roleless role” (Burgess 1960), 
retirement has gradually assumed positive connotations as pensions have risen 
and benefits been extended to a whole age group. 

By the early 1970s, pensions in all highly industrialized countries were 
adequately replacing earned incomes for most retirees. As Myles (1988) has 
pointed out, the modern retirement era in the United States was inaugurated 
during Nixon’s presidency, when “enhancements in the retirement wage” 
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allowed retirees to maintain their standard of living. As a consequence, 
retirement has been redefined as a legitimate nonworking status that average 
income workers can afford to enter voluntarily. Owing to higher pensions, 
retirement has gradually been seen, and defined, as a time of leisure rather 
than a period of “deficits,” as the time of entitlement to rest after a life of 
work. Donahue et al (1960) had foreseen this: “Retirement comes to be viewed 
as a period of potential enjoyment and creative experience which accrues as 
a social reward for a lifetime of labor.” 

New words have emerged to embody the changing meanings of the social 
construction of old age through retirement. In France, the phrase troisième 
age ("third age") was coined in the mid-1960s with the intention of effacing 
old age’s image as a time of lack, disability, and degradation (Aries 1973). 
Based upon retirement secured by a pension, old age has become an active, 
autonomous period of full citizenship and social involvement. In the United 
States, the phrase “senior citizens,” coined at about the same time, bears a 
similar meaning. 

A final meaning of retirement is the process of passing from adulthood to 
old age, the transition from work to nonwork. Research in the sociology of 
retirement initially focused on “going on retirement.” These mostly American 
or British studies asked how individuals adjusted to this change of status and 
roles. The aim was to discover the factors of successful adjustment (Pollack 
1948, Havighurst 1957, Townsend 1957, Cumming & Henry 1961). For a 
long time, the sociology of retirement concentrated on the question of 
"voluntarism," as the individual moved toward retirement, and on the 
financial, social, and psychological consequences of leaving the labor market 
(Hardy 1992). 

By focusing on the factors underlying individuals' adjustments to retirement 
and levels of satisfaction, sociologists tended to overlook changes in the 
transitions to retirement and the reasons for them. (See Guillemard 1982 for 
a critical review of this literature.) For this reason, we can hardly restrict this 
research subject to sociology. Economics, too, has contributed to our 
understanding, even though sociology has often advanced by criticizing the 
utilitarianism of such analyses. Economists have broadened the scope of the 
sociology of retirement beyond the psychosocial aspects of new retirees’ 
adjustments and their satisfactions or dissatisfactions. 

Herein, the concept of retirement will be taken to be primarily a process 
or transition. Patterns of transition toward definitive inactivity have changed 
significantly in developed countries. Models of retirees’ behaviors have not 
undergone change insofar as they receive old-age pensions from the system 
of retirement institutions; but current retirees’ experiences before admission 
into this system are quite different from those of preceding cohorts. 

In developed industrial societies, the major contemporary change in 


472 GUILLEMARD & REIN 


retirement patterns has been the early definitive withdrawal from the labor 
market of masses of older wage-earners before the age of admission into 
public old-age pension systems. This early exit phenomenon has recently 
spurred reforms, or calls for them, that postpone admission into these systems. 
The timing of retirement, as well as the ways of leaving the world of work 
and entering a period of economic inactivity on a pension, have changed 
significantly. All this has, in turn, affected retirement's many dimensions. As 
a consequence, the social meaning of retirement 1$ coming undone. 

These observations make it necessary to use the term retirement more 
precisely. Herein, retirement shall apply only when people are admitted into 
a public old-age pension scheme. In general, this admission is based on age. 
When, however, people have definitively stopped working but have not yet 
been admitted into the public old-age pension system, we shall here refer to 
"early exit" or "early withdrawal from the labor market." This should help to 
avoid conceptual confusion. 

In the first part of this article, international statistics are compared to shed 
light on recent changes in the transition toward retirement, especially those 
related to the early, definitive exit of increasing masses of aging wage-earners. 
Attention then shifts from current empirical findings to a review of three major 
lines of theoretical inquiry in national or cross-national studies about the 
development of this worldwide early exit trend during the past 10-15 years. 
First of all, attempts have been made to relate this massive trend to changes 
in welfare systems, especially in pension systems. Secondly, some scholars 
consider that the major factors determining this trend lie in the labor market, 
in rising unemployment, and the reorganization of production. A third line 
of inquiry has worked out an explanation based on the changing life course 
as related to developments in social protection, an explanation that may or 
may not go so far as to analyze the relations between social policies and social 
models of the life course. In the final part of this article, we discuss the 
implications of this trend for the future. 


THE INTERNATIONAL EARLY EXIT TREND: FACTS, 
FIGURES AND QUESTIONS 


A fundamental change has occurred in nearly all highly industrialized Western 
countries: the transition toward retirement 1s now taking place earlier in the 
life span—well before 65, the age of entitlement to full old-age benefits in 
most of these countries. During the past 15 years, the labor force participation 
rate of men from 55 to 65 years old has dropped significantly. Of course, we 
must refine our analysis by exploring two questions. First, how can we 
distinguish the early exit trend from the century-long trend that has lowered 
the labor force participation rates of persons who are over the normal 
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retirement age? Second, has the early exit trend affected women as much as 
men? 


The Rising Early Exit Trend 


The drop in the labor force participation rates of 55-64 year-old men during 
the past 15—20 years has been massive and widespread (Figure 1). However 
labor force participation rates overestimate this age-group’s economic activity, 
because persons enrolled in unemployment are counted as being in the labor 
force even if they have but slight chance of finding jobs again. In fact, receipt 
of unemployment insurance can be (and is, in many countries) a new way to 
leave the labor force definitively, especially whenever unemployment pro- 
grams provide special allocations to aging persons who have lost their jobs. 
In France, for instance, a person dismissed at the age of 55 receives the 
maximal rate of benefits with prolonged coverage under the Unemployment 
Fund till 60, when he would receive full retirement benefits. In Germany too, 
unemployed wage-earners at age 52, were entitled to early retirement at 60. 
They were considered to be waiting on retirement and to have definitely 
stopped working. New legislation will gradually raise the age of eligibility 
for a public pension (Jacobs et al 1991). The ILO, Eurostat, and OECD define 
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Figure 1 International time series of employment activity rates of men from 55 to 64 years old 
(Source. OED Labor Force Statistics) 
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the labor force as the sum of employed and unemployed persons. Labor force 
participation rates represent the labor force as a percentage of the population 
of a certain age-group. Employment activity rates represent only persons in 
employment, i.e. persons who, during a reference week, did any work for 
pay or profit or who were not working but had jobs from which they were 
temporarily absent (The number of hours worked during the week varies 
depending on the statistics). Because employment activity rates take into 
account only persons in employment, the unemployed are excluded. This 
provides a more relevant measure of activity in the later years of life. 

The employment activity rates in Figure 1 highlight annual decreases in 
the proportions of 55—64 year-old men still working from 1970 to 1990. By 
1990, the employment activity rate had fallen below 50% in France and the 
Netherlands; and it was close to 5096 in Germany. Elsewhere, from one half 
to two thirds of the men in this age group had jobs in 1990, as compared with 
more than three quarters in 1970. This is true except for Japan and Sweden, 
where men's employment activity rates have decreased less than 1096 since 
1970. In contrast, these rates have decreased about 40% in France and the 
Netherlands, and about 3096 in the United Kingdom and Germany during this 
same period. In the United States and Canada, the decrease has not been so 
large, about 20%. 

Several conclusions can be drawn from Table 1, which refines the analysis 
by age-group. First, the employment activity rates of 60—64-year-old men 
plummeted and are now very low. Only Sweden and the United States have 
experienced a lesser decrease. Secondly, the employment activity rates of 
55-59 year-old men have not decreased in like proportions; the variation in 
the countries where this trend has been strongest is below 25% 1n France, the 
United Kingdom, Netherlands, and Germany. In Sweden and the United 
States, this age group’s activity rate has not varied much. This downward 
trend massively affected the 60—64 age group before it started affecting the 
55—59 one. In recent years, the downward trend of 55—59 year-olds' 
employment activity rates seems to have steadied, at least in the United States. 
It is premature, however, to generalize Quinn's (1991) conclusion that in the 
United States the early exit trend is ending or even being reversed. This 
problem has apparently not occurred for most EEC countries. 

In summary, we can say that employment activity rates after the age of 55 
have dropped significantly since 1970. For three countries—France, the 
Netherlands, and Germany——inactivity after 55 has even become a new norm. 
Today, between one half and three quarters of 60-64 year olds are no longer 
in the labor market. After sweeping up this age group, the early exit trend 
started reaching down among 55-59 year olds. In four major European 
nations, one out of three persons in the latter group are no longer in the labor 
market. 
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Table 1 Changes in male employment activity rates (%) by age group (Source. OECD Labor 
Force Statistics and our own calculations ) 


% point 9b 
1975 1980 1985 1987 1988 1989 change change 

55—59 year-olds 
West Germany 82.7 783 707 709 696 69.1 -13.6 -16 
France 81.2 774 626 61. 615 625 -18 7 -23 
Netherlands 768 72.8 603 62.4* 63.0 £616 -15 2 —20 
Sweden 88 9 86.8 856 847 84.9 863 -26 -3 
United Kingdom 897 84.7 678 65.5 677 68.0 -21 7 —24 
United States 79.8 78.2 75.5 761 759 760 -3.8 -5 
60-64 year-olds 
West Germany 552 41.4 317 324 319 31.8 -23 4 -42 
France 55 1 499 294 248 24 1 23 1 —32 0 —58 
Netherlands 623 463 267 267* 256 £235 —38.8 -62 
Sweden 72.3 674 617 £62.4* 627 61.6 -10 7 -13 
United Kingdom 746 | 606 49.6 50.0 513 511 -23 5 -32 
United States 61.6 = 577 527 524 519 - 524 -9.2 -15 


"Break in time series because data are calculated on a new basis 


Massive Early Exit: Part of the Long-Term Trend toward 
Earlier Retirement? 


As shown in Kohli et al (1991: 36), the sudden, massive early exit trend can 
be distinguished from a trend that gradually lowered the age of “normal” 
withdrawal from the labor force as pension systems were created and expanded 
from the late nineteenth century to the 1970s. Owing to this long-term trend, 
a tripartite model of the life course was widely adopted by the late 1960s. In 
this model, retirement at the third stage became a reality for nearly all people 
at the normal age of entitlement to a full public old-age pension (usually 65). 
Early in this century, the labor force participation rate of men over 65 was 
about 70%. By 1970, it had declined sharply to barely 20% in the United 
States, United Kingdom, France, and Germany. No more than 10% of 65—69 
year olds are still working in most countries, except Denmark (where the age 
of entitlement to a full pension is still 67) and a few southern European 
countries (notably Portugal and Greece). 


Women's Activity Rates after the Age of 55 


This empirical analysis has been based on statistics about men. Clearly, it is 
important to study changes in women's exit patterns at a time when more and 
more women have been entering the labor market. Unfortunately, we cannot 
enter this analysis given the limits of this article. 
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The case of women calls for a differentiated treatment, both at the level of 
empirical data and of research questions. It refers to a different, less abundant 
sociological literature. One question to be addressed is why women’s careers 
are more diverse than men’s. While far more women than men have 
discontinuous careers, increasingly female labor force experience is becoming 
more and more like that of men. This is especially true in Sweden. 
Consequently, the way women experience the transition toward definitive 
inactivity probably has a different meaning. Other questions arise because 
women’s labor force participation rates vary widely: about one third of women 
work in the Netherlands, compared with more than 80% in Sweden! These 
variations make it hard to conduct a comparative inquiry including both 
genders. 

The early exit trend has, as Kohli et al (1991:65) have shown, affected 
women as well as men; but the impact on women cannot be detected with 
cross-sectional statistics. To do this, a longitudinal analysis by cohort is 
necessary because two contrary trends have been affecting women. In 
cross-sectional analyses, the early exit of women over 55 is obscured by 
women’s massive entry into the labor market. This second trend has mainly, 
but not exclusively, involved younger women. For women, cross-sectional 
analyses by age group tend to indicate that, during the past 15 years, the labor 
force participation rate of women born before World War II has remained 
stable, or slightly increased. This leads, of course, to mistaken interpretations. 
In contrast, a longitudinal analysis shows a similar trend toward early exit for 
both women and men. 


Issues for Research 


During the past 15 years, the employment activity rates of persons over 55 
have decreased in nearly all Western countries, with the notable exception of 
Sweden. What general factors account for this? Is this trend merely a trend 
to earlier retirement? Or are we observing a basic change in the transition 
from work toward economic inactivity and, finally, retirement? In other 
words, does this change amount to more than a modified timing of retirement? 
Does it signal a new distribution of work and nonwork over the life span? 
Do these changes in the transition toward retirement broaden the choices 
ageing individuals face as, thanks to early exit measures and welfare programs, 
they decide whether to rest earlier than previous generations? Or is the choice 
already made for them, owing to economic circumstances and worsening 
unemployment? Has there been a decision to transfer jobs from older to 
younger wage earners? Or is early exit linked to changes in the way work is 
being reorganized and made more flexible? Are they part of a "flexible life 
scheduling breaking the education-work-retirement lockstep" (Best 1980)? If 
$0, activity and inactivity are being redistributed over the whole life span on 
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criteria that are not so tightly tied to age. In this case, older people could be 
maintaining economic and social activities beyond gainful employment. The 
notion of activity must then be broadened to cover socially useful, but unpaid, 
voluntary activities. Some researchers have detected such a trend, which tends 
toward the emergence of new, socially useful roles in groups already 
categorized as aged (Riley & Riley 1989). 


CHANGES IN SOCIAL PROTECTION AND EARLY EXIT 


Most economic and sociological studies have taken the system of social 
protection, whether private or public, its evolution and the politics underlying 
it to be the determinants of the early exit trend. But this body of literature is 
far from homogeneous. To clarify things, we shall distinguish between two 
theoretical approaches. The first focuses on the individual’s decision defini- 
tively to leave the labor force. Grounded in rational choice theory, it tries to 
show how this decision is shaped by the whole system of public and private 
social protection, its rules and incentives for work or retirement. This system 
is, therefore, a “pull” or “push” factor stimulating older workers, or certain 
categories among them, to choose to stop working because benefits outweigh 
costs for the individual or because their health is failing. In contrast to this 
individualistic pull/push factor interpretation, the second approach focuses on 
the social dynamics that have generated welfare systems. This approach gives 
emphasis to a structural, or what can be called a “political economy,” 
perspective. 


Social Protection as a Pull/Push Factor in the Individual’ s 
Choice 


During the past 15 years, sociologists and economists have widely adopted 
this mainly American approach, which seeks to shed light on “push” and 
“pull” factors in the individual’s choice between retiring and staying in the 
labor market. The pull comes from the coverage and monetary rewards 
provided by welfare systems; the push, from the individual’s state of health 
and his chances of staying in the labor market, which are tied both to the 
unemployment rate and to labor supply and demand in each country. Of 
course, macrosocial policies, which also provide incentives and disincentives 
for individuals’ decisions, blur the line between push and pull factors. Though 
theoretically clear, the distinction between supply and demand is difficult to 
discern empirically; but these difficulties do not outrule the usefulness of both 
concepts. Studies of push factors in the labor market are reviewed later on. 
As for pull factors, improvements in coverage under pension systems and 
the broadened eligibility for pensions before the normal retirement age (in 
many countries since the mid-1970s), have changed the conditions impinging 
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on the individual’s choice between retirement and work in the latter part of 
a career. Individuals are faced with new opportunities and constraints. The 
decision to withdraw early “costs less,” whereas the decision to go on working 
has become less attractive. Since Barfield & Morgan (1969) pioneered this 
approach, other studies have been made: Danzinger et al (1981), Wise (1985), 
Kotlikoff & Wise (1987), Tuma & Sandfur (1988), Packard & Reno (1988), 
Morrison (1988), Quinn et al (1990), Quinn & Burkhauser (1990) and Ippolito 
(1990). For an overview, see Aaron & Burtless (1984). 

Economists have mainly emphasized the implicit effect of monetary 
incentives in public and private pension systems. By working out microeco- 
nomic models, they have tried empirically to prove that these incentives 
amount to “work disincentives” in the later years of life. Under these models, 
individuals supposedly behave like a pure homo economicus. According to 
Quinn & Burkhauser (1990:313—14), 


An employee eligible for benefits who is considering working another year is 
really choosing between two retirement income streams, one starting today and 
another beginning in a year but with higher annual benefits If the asset value of 
the second stream is larger, the employee receives a pay check plus the increase 
in pension wealth for working that year. Total compensation exceeds traditionally 
defined earnings. But if the second (delayed) stream 1s smaller in present value, 
true compensation is less than the pay check by the amount of the wealth loss 
Recent research suggests that the latter is the case for many older workers. They 
suffer pay cuts 1f they stay on the Job too long, not directly by the paycheck but 
more subtly, through the details of their retirement income plans. 


To work out this interpretation, these two scholars have assessed the 
impact of incentives and disincentives on pension systems over the entire 
life span and not just at the time when the decision to retire is made. For 
them, social security and private pension systems serve as incentives for 
working at certain ages (notably during youth) but penalize this choice at 
the end of careers. 

The sociologists who have adopted such an interpretation share the 
economists' vision of pension systems pulling people out of the labor market 
before the normal retirement age. In most of their studies, however, other 
factors come into play. These sociologists have shown how social stratification 
differentiates the model of transitions toward retirement. Each group of 
wage-earners—owing to its socioeconomic position, past work experiences, 
and state of health, and depending on whether it works in a declining or 
growing industrial sector—faces a different set of interrelated opportunities 
and constraints when its members decide to interrupt careers. Economists' 
pull factor interpretations have been reworked so as to bring structural 
variables into the analysis (Hardy 1982, 1985, Fillenbaum et al 1985, Hayward 
et al 1989, Gohmann 1990, Hayward & Grady 1990). These sociologists have 


COMPARATIVE RETIREMENT PATTERNS 479 


shown that financial incentives for early exit do not affect all social groups 
alike. 

Other sociologists, who have also focused on the individual’s decision to 
stop working, have attributed this decision to cultural factors rather than 
economic calculations. Inkeles & Usui (1988), who think economic explana- 
tions are too narrow, give priority to purely sociological factors such as future 
retirees’ values, aspirations, and attitudes with regard to work and retirement. 
What retirement means to individuals and how much it attracts them, depends 
on how strong the work ethic is and on whether people think that the welfare 
state will help them maintain their incomes during the retirement years or that 
individuals should prepare for retirement themselves ahead of time. For more 
than 60% of people surveyed in the United States and Japan, old-age security 
mainly depends on the individual and his savings during his working life. In 
contrast, most people surveyed in France and the United Kingdom think 
welfare programs should provide this security (Inkeles & Usui 1988). 

The individualistic pull effect interpretation provides policy-makers with a 
scientific rationale for restricting early exit. Given the alarming prospects of 
demographic aging and the welfare state’s financial crisis, they advocate 
prolonging the working life so as to maintain a balance between the 
beneficiaries of and contributors to old-age funds. Research based on this 
interpretation has established that, in the United States, a direct cause-effect 
relationship exists between the early exit trend and the financial incentives 
embedded in retirement systems. The amendments to the 1983 Social Security 
act, which are just taking effect, would delay eligibility for a full old-age 
pension and abolish the “work disincentives” contained in the earnings test. 
Other countries, like Germany, have planned to raise the eligibility age for 
pensions and gradually to do away with the attractive arrangements that opened 
the early gateway too wide (Jacobs et al 1991). In line with Quinn & 
Burkhauser’s explanation (1990), these pension system reforms may very well 
reverse the early exit trend. 

Despite its impact on policy-makers’ decisions, the foregoing interpretation 
has come under intense criticism from scholars. Its paradigm is that the 
individual’s choice between work and leisure in the later years of life is 
determined by the advantages and disadvantages implicit in institutional 
arrangements for retirement. Four major criticisms will be discussed. 

First of all, the assumption that early exit results from a personal, rational 
choice is not corroborated in many countries. In France and the United 
Kingdom, 55—64-year-old wage earners who have been forced out of the labor 
market think of themselves as “discouraged workers” with no hope of finding 
work; they do not at all see themselves as retirees (Casey & Laczko 1989, 
Guillemard 1989, 1991a,b, Walker 1985). Furthermore, in many countries 
(including Germany), the rising percentage of the long-term unemployed over 
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50 years old proves that early exit is imposed rather than chosen, even though 
those concerned may not be dissatisfied with their lot. According to 
Abel-Smith (1992: 257), who has emphasized the impact of the early 
retirement programs implemented by American firms, “Leaving the labor 
force has been forced on workers by employers shedding labour.” All this 
evidence points to the labor market as a major determinant of early exit (See 
section on Changes in the Labor Market and Early Exit). 

Secondly, many scholars are skeptical about the validity of the research 
that shores up the individualistic pull factor interpretation. The latter’s 
proponents are accused of neglecting changes in the labor market, specifically 
in forms of employment, a topic to be treated in the following section. In the 
introduction to his book about aging wage-earners in a changing labor market, 
Doeringer (1990: 16) has written: "There is a substantial literature suggesting 
that the Social Security system and private pensions have lessened the 
attachment of older persons to the labor market. Although the evidence of 
diminishing attachment is clear, the link to pensions is much less certain." 
For Sheppard (1990) too, pensions alone do not explain the drop in older 
wage-earners' labor force participation rates, because the early exit trend has 
come from companies' efforts to get rid of ageing workers and public 
authorities' eagerness to solve employment problems. 

A third criticism has pointed out that the individualistic pull factor 
interpretation goes along with a fragmented view of the welfare systems 
whereon individuals base their choices. Hatzfeld (1971), Graebner (1980), 
Quadagno (1988), Schultz & Myles (1990), and Schultz (1985 and 1991) 
have studied the historical functions of retirement systems. Dealing with the 
attraction exercised on individuals by private and public pension systems, 
these studies have shown that retirement has been a means not only of 
redistributing income but also of labor management and control as well as of 
social stratification. At a macroeconomic level, company heads and public 
authorities constantly use these means in policy-making, in particular to deal 
with labor redundancy or unemployment. This observation calls for a global 
interpretation that takes into account the welfare state's internal rationale, 
interactions between welfare policies, employment policies, and retirement 
policies, and the links between social policies and conditions in the labor 
market. 

A fourth criticism is that, since the early exit trend has affected nearly all 
developed countries, it calls for a cross-national explanation. The individual- 
istic push-pull factor analysis has been worked out for the American case 
alone. Since it has no "international comparative design," it is unable to 
provide anything more than ad hoc explanations, which cannot explain such 
a widespread phenomenon. 
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“Political Economy” Approach to Social Protection and 
Early Exit 


"Political economy" can be defined as “the study of interrelationships between 
economy and society, or more specifically, the reciprocal influences among 
government, the economy, social classes, strata and status groups" (Walton 
1979: 9). This global, structural, cross-national perspective proposes a new 
understanding of the early exit trend. In this vein, two sorts of research have 
been conducted on early exit: on the one hand, global, comparative studies 
of the impact of welfare state institutions on this trend and, on the other, 
studies about policy-making and its social dynamics. 


A Global Line of Inquiry 


As comparative international studies have shown, most "institutional path- 
ways" for early exit have been opened through modifications in public or 
private welfare programs (Casey & Bruche 1983; Kohli et al 1991). In most 
European countries, unemployment and disability funds, as well as old-age 
funds, have served as early exit pathways between withdrawal from the labor 
market and entry into a retirement pension system. In countries where such 
programs are limited, pathways have been opened under firms' private 
arrangements. In the United States for instance, these pathways have passed 
through early retirement incentive programs (ERIPs) and company pension 
plans. This global approach to welfare institutions enables us to see beyond 
the push/pull perspective that is centered on the individual's decision to stop 
working. 

Welfare institutions' "pull" toward early exit trend cannot be assessed by 
looking at retirement pension systems alone. Unemployment and disability 
funds have also played a part by providing income maintenance programs. 
Moreover, public authorities in some countries have long pursued active 
employment policies. For instance, Sweden's public manpower programs 
provide ongoing opportunities for job training and reclassification to the aging 
and handicapped in particular. Such programs, which provide incentives for 
staying in the labor market, have to be included in any global assessment. 
They help explain why Sweden has more or less withstood the early exit trend 
(see Figure 1). 

To assess the impact of social protection on early exit, the whole, complex, 
interactive set of public as well as private welfare programs must be taken 
into consideration. 

Calcoen & Greiner (1989), Guillemard (1991a), and Guillemard & Van 
Gunsteren (1991) have emphasized the importance of a global approach to 
welfare institutions. They have shown that the new intermediate programs 
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between activity and retirement, which made early withdrawal possible, shift 
the boundaries between risks within social protection systems. For 
Guillemard, the welfare state’s internal architecture has been modified in two 
major ways. 

First of all, public retirement systems have lost their major role of regulating 
definitive withdrawal. Out of the six countries in Guillemard & Van 
Gunsteren’s comparative study, the retirement system has maintained this role 
only in Sweden, thanks to a scheme for part-time retirement after 60. In 
Germany, France, the Netherlands, United Kingdom, and United States, many 
wage-earners—in fact, about 70% in the first three countries—definitively 
stop working well before receiving a public old-age pension. The eligibility 
requirements (age and number of years worked) for such pensions no longer 
fix the moment of definitive exit. Few wage-earners stil] follow what used to 
be a nearly universal model of a direct transition from full-time employment 
to full-time economic inactivity on a pension. To cite the German example, 
35% of admissions into the public retirement system in 1987 came directly 
from the labor force whereas 40% had previously been on disability and 11% 
on unemployment (Jacobs et al 1991). 

Secondly, disability or unemployment funds have been taking the place of 
old-age funds in regulating definitive withdrawal. Their initial objectives have 
been modified so as to cover new "risks." For instance, unemployment funds, 
set up to provide compensation for a limited time to wage-earners who lose 
their jobs, now have to serve benefits for five years or more to aging workers 
with no hope of finding jobs. These funds' eligibility requirements and 
compensation rates are not consistent with this new assignment as a pseudo 
old-age fund. In France, more than 5096 of beneficianes from 1982 to 1984 
of unemployment compensation were over 55 years old. In the Netherlands, 
Germany, and Sweden, the boundary between the risks covered by the 
unemployment and disability funds has blurred (This 1s somewhat true of the 
United Kingdom too). Since the risk of disability has been expanded to cover 
unemployment, the impossibility of finding a job in the local labor market 
has become proof of a loss of fitness for work (Piachaud 1986, Blomsma & 
de Vroom 1991, Jacobs et al 1991, and Wadensj6 1991). In these three 
countries, this confusion between risks has been embedded in legislation. 
Old-age funds, too, have been used by governments in drawing up employ- 
ment policy. This was what happened in France when, in 1983, the retirement 
age was lowered to 60 (Guillemard 1986a, 1991b). Tbe "risks" of unemploy- 
ment, disability, and old-age have been modified, along with the notion of 
retirement. In several countries, this remodeling of social protection has led 
to confusion between risks and forms of coverage. As a consequence, 
controlling social expenditures has become impossible. 

The very malleability of institutional early exit arrangements has augmented 
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the upheaval in welfare systems. As a consequence, these arrangements have 
been continually modified. To take but one example, most preretirement 
schemes—introduced to deal with the employment situation—were in effect 
for a limited period only. In Germany, the program set up under the 1984 
Preretirement Act was suspended in late 1988. The United Kingdom’s Job 
Release Scheme was abolished in 1989. In France, Preretirement Solidarity 
contracts were no longer signed after 1985, and eligibility requirements were 
tightened under other preretirement programs. For Casey (1989), this rework- 
ing of early exit arrangements has come out of a complex dynamics between 
"instrument substitution" and "cost shifting.” 

The analysis of early exit determinants cannot be limited to old-age pension 
systems alone. The whole system of social protection, private as well as 
public, has abetted this trend. Tracy & Adams' (1990) study of the age of 
"first pension award under social security" 1n eight countries illustrates the 
limits of an approach focusing on pensions alone. The major institutional 
components of the welfare system that encourage early exit must be 
investigated. 

In their enlightening demonstration of the explanatory power of a global, 
structural, cross-national analysis of how "welfare states' regimes" have 
affected early exit, Esping-Andersen & Sonnberger (1991: 232) sought to 
understand "how different kinds of Welfare States and different kinds of labor 
markets affect elderly male labor force participation." This research is based 
on time series data about Germany, the United States, and Sweden and on 
information about modifications of welfare programs there from 1965 to 1987. 
Previously, Esping-Andersen (1990) had worked out a theoretical explanation 
about these three "welfare regimes." His 1991 publication uses two labor 
market indicators: the unemployment rate (an indicator criticized by Rein & 
Jacobs 1992, see section on Changes in the Labor Market and Early Exit) 
among men from 55 to 64 years old and the number of jobs lost per year in 
manufacturing. The different combination of push and pull factors explains, 
for each welfare state regime, the impact on early exit. In Germany, push 
factors have come into play as the number of jobs has increased moderately 
and government policy has not done much to provide elderly workers with 
job opportunities. In this context, social legislation made early exit “feasible 
and attractive." Hence, pull factors were at work there too. The German case 
is more or less typical of northern Europe (outside the United Kingdom, as 
will be shown). In Sweden too, legislation made early exit attractive, but this 
pull was counterbalanced by social policies that offered older wage-earners 
opportunities for staying in the labor market. Furthermore, no push factor was 
at work there. This explains why Sweden is unique in that older wage-earners' 
activity rates have decreased very little. Contrary to push/pull factor theorists, 
Esping-Anderson concludes that pull factors have had a rather limited effect 
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in the United States. Unlike Sweden, the United States has not offered older 
wage-earners alternatives to early exit whenever push factors—the plunging 
labor market—come into play. This research shows how important it is to 
place the early exit trend in the nexus of welfare state policies and labor 
markets. It also demonstrates the necessity of viewing it from a cross-national 


perspective. 


The Politics of Welfare Policies as an Explanation of Early 
Exit 

This line of inquiry runs counter to the theory that early exit results from 
individual choice. It implies that individuals' behaviors are shaped by social 
forces and capitalistic structures. Such research generally makes a sociohiston- 
cal analysis of the behaviors and strategies of the major group actors; the 
outcome of their conflicts and negotiations are interpreted in terms of welfare 
systems. Studies on aging and retirement policies have been carried out from 
this perspective by Estes (1983, 1991), Guillemard (1983, 1986b), Hendricks 
& Calasant: (1986), Hendrick & McAllister (1983), Myles (1984), Myles & 
Teichroew (1991), Olson (1982), Phillipson & Walker (1986), Quadagno 
(1984, 1988), Estes & Minkler (1984, 1991), Walker (1991) and Townsend 
(1986). Fewer studies have been made about early exit itself. 

Guillemard & Van Gunsteren (1991:366—69) have drawn the following 
conclusions from their comparative study of early exit policies in France, 
Germany, Great Britain, the Netherlands, Sweden, and the United States. 
Until recently, most intermediate programs between full-time work and 
full-time retirement arose out of an implicit agreement that labor unions, 
employers, and the state had reached in a context of recession and rising 
unemployment. Each actor had its reasons for supporting this agreement. For 
unions, early exit would free wage-earners with long careers from work while 
transferring jobs to young people. For employers, it provided a means of 
trimming the work force, reorganizing production, and making personnel 
management more flexible. For public authorities, these intermediate pro- 
grams would reduce the supply of (older) labor while providing coverage to 
the victims, who would not, therefore, be counted as unemployed. Govern- 
ments expected that such programs would help improve the employment 
situation by the early 1980s as new opportunities were provided to jobless 
young people. 

Employers, however, played the leading role in opening early exit 
pathways. In the Netherlands, companies, especially when in difficulty, 
helped make the disability fund into such a pathway. In France, employers 
took the lead by proposing the first guaranteed income scheme for aging, 
dismissed workers. Because this scheme's benefits depended on dismissal, 
employers maintained full control over early exit while passing the costs onto 
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the unemployment compensation fund (Guillemard 1985). The employers’ 
role was even clearer in the United States, where public unemployment, 
retirement, and disability insurance systems could not be so readily adjusted 
for early exit. Companies used private pension funds to attract older employees 
toward early exit, despite the 1986 amendment to the Age Discrimination of 
Employment Act. Unable to openly practice age discrimination, they set up 
Early Retirement Incentive Programs (ERIP) financed through their private 
retirement funds. This made early exit more attractive (Hardy et al 1993). In 
most countries, employers have continually sought to use public as well as 
private programs to trim the aging work force. If necessary, they have added 
incentives by providing severance pay (in Sweden and France) or by making 
company pension funds (in the United States and Great Britain) serve full 
benefits to early exiters without any actuarial reduction of pensions due to 
early exit. Even in Sweden, firms have bypassed job security and integration 
programs so as to obtain more flexibility 1n managing the older work force. 

The social dynamics of the agreement among labor unions, employer 
organizations, and the state about the early withdrawal of aging wage-earners 
has been losing force since the late 1980s. It is difficult to foresee how these 
forces will realign and what fate will be meted out to aging workers. For 
employers, getting rid of older wage-earners is no longer a panacea. Some 
firms have even announced their intention to “make something new with the 
old," given demographic aging and the scarcity of young people with 
diplomas. Nonetheless, personnel practices do not seem to have changed much 
as a consequence, and firms are still eager to dismiss older workers. 

After promoting early exit to cope with unemployment and then to open 
jobs for young people, public authorities are increasingly refusing to fund 
such arrangements. They are restricting and even terminating so-called 
preretirement programs. Given the prospects of demographic aging in highly 
industrialized countries, the major stake in public actions has become the 
long-term containment of the costs of old-age funds. Public authorities are 
now mainly concerned with preventing the collapse of these funds. To this 
end, they have launched three sorts of actions. 

First, public pension systems are being reformed. The retirement age has 
already been raised in Germany, the United States (Sheppard 1991), and Italy; 
and it is likely to be in France too. Another major reform is the turn toward 
flexible retirement systems. Partial retirement has been introduced with limited 
success in Germany and France; and flexible retirement has been proposed in 
Great Britain to cover the "retirement decade between the ages of 60 and 70 
(Laczko & Phillipson 1991). 

Secondly, given the exponential growth of social programs, governments 
are withdrawing financial support from early exit and trying to shift the costs 
to semipublic or private arrangements. Jn the United States (Hardy et al 1993), 
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Great Britain (Walker 1991, Laczko & Phillipson 1991), and Canada (Myles 
& Teichroew 1991) this strategy is evident; and it is being pursued more 
discreetly in some other European countries. The private-public “welfare mix” 
being concocted differs from the old economic liberalism. Roles are being 
redistributed so that the welfare state, which has protected citizens or wage 
earners from vicissitudes in the labor market, becomes a minimal welfare 
state supplemented by a “corporate welfare state that sustains capital produc- 
tion not only through programs of income maintenance directed toward 
workers as beneficiaries but also through direct subsidization and regulation 
of corporate policy” (Hardy et al 1993: 22). 

This tendency exists even in welfare states with broad, excellent coverage, 
as in the Netherlands. The Dutch company-financed VUT (early retirement) 
programs accounted for nearly half of early exits in 1988 as compared with 
9% in 1980. Compare this to what has happened there to a well-trodden early 
exit pathway: the public disability fund accounted for 78% of early exits in 
1980 but for 40% nowadays (Aarts & de Jong 1993: 33). 

Thirdly, the state is seeking to make employers and labor unions respon- 
sible, to make them realize that financial constraints and demographic 
prospects necessitate reforming the social programs that come out of collective 
bargaining. In France for example, the Ministry of Labor has reprimanded 
firms for systematically eliminating older wage-earners and asked them to 
implement active employment policies (training, reclassification, etc) with 
regard to this population. Meanwhile, firms are being made to pay a larger 
share of early exit costs (Guillemard 1991b). 

For the already mentioned researchers who have studied welfare policies, 
the compromises underlying early exit arrangements also determine 
beneficiaries’ behaviors and their social status. These persons have very little 
free choice. Their fate is molded by the social forces whose interests 
predominate in compromises. Unless effectively conveyed by labor organiza- 
tions, their interests are seldom taken into account. The early exit trend is thus 
seen as marginalizing older wage-earners 1n response to companies’ short-term 
needs. Its future mainly depends on what firms will do in the middle run. 

National differences with regard to the fate of early exiters result: from the 
different ways governments cope with joblessness and employment problems; 
from variations in coverage under public welfare programs; and from the way 
social protection is managed (whether by the state or by corporations). Some 
British researchers have pushed the marginalization argument toward a nearly 
Marxist view of the state as an instrument in the hands of the ruling class. 
Accordingly, the state "has used older workers as a reserve army of labor to 
be ejected or encouraged to stay at work according to perceptions of 
demographic changes and the likely supply of younger workers" (Laczko & 
Phillipson 1991: 244). 
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Walker (1983, 1986, 1991) argued that the capitalist state has, through 
welfare programs, assigned the elderly a dependent socioeconomic status. 
This explanation, though plausible as regards structured dependency, is 
excessively deterministic (Guillemard 1986b,c, 1991c; for a criticism, see 
Johnson et al 1989). 

Research has criticized the dualism produced by retirement and preretire- 
ment arrangements, which divide beneficiaries into two distinct groups. 
Atkinson & Sutherland (1993) have taken up Titmuss' (1955) assertion to 
refer to "two nations in early retirement" in Great Britain: those forced to live 
on welfare and those receiving occupational pensions. This dualism does not 
seem to be spreading to the rest of Europe, but it may soon do so if public 
authorities are determined to curb social expenditures and leave firms free to 
manage as they please older wage-earners. 


CHANGES IN THE LABOR MARKET AND EARLY EXIT 


How is early exit related to the labor market? Analysts of the labor market 
interpret the issue differently than those who study pension policy. As shown, 
the latter usually examine the aging of the population and suggest major 
adjustments in pension levels, contribution rates, or the retirement age. In 
contrast, the former focus on the impact of aging on average productivity 
levels and the aggregate wage bill. They emphasize the economic value of 
early exit policies for firms (Blanchet 1993). 

This review of studies about the impact of labor market dynamics on 
definitive withdrawal from employment shows how productivity gains, 
unemployment, restructured employment relations and age productivity pro- 
files—operating through different mechanisms——have all reinforced the early 
exit trend. One argument is grounded in the secular trend that has enabled 
more and more workers to leave the labor force earlier. Two other arguments 
have to do with industrial policy. A fourth views early exit as the macroeco- 
nomic outcome of the rational firm's microeconomic decisions about industrial 
restructuring or its response to older workers’ declining productivity. All these 
arguments lead to the same conclusion: early definitive withdrawal is 
economically beneficial to firms. The questions raised about each argument's 
empirical and theoretical assumptions are also reviewed. 


Productivity Gains 


Industria! growth, fueled by rapid technological change, has dramatically 
increased output, while labor input has remained steady. These productivity 
gains can be allocated in three ways: increases in wages, decreases in hours 
worked, or a shorter working life. 

In nearly all industrialized countries, the long-term result of productivity 
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gains has been a lesser need to work in later years, whence a new 
phenomenon—retirement. As Donahue et al (1960: 331) have stated: "Retire- 
ment is a phenomenon of modern industrial society... The older people of 
previous societies were not retired persons; there was no retirement role.” 
Productivity gains were the major economic factor underlying the emergence 
of a clear boundary separating work from retirement. In most Western 
economies, the retirement age was set at about 65. In the 1950s and then 
again in the 1970s, several countries began either lowering the retirement age 
or making flexible retirement possible. These legislative changes were 
principally motivated by the desire to spread the benefits of postwar economic 
growth. For example, in 1956, the United States introduced early retirement 
at 62 for women only. A working wife and a retired husband would, it was 
argued, disturb traditional family values. Under flexible retirement at age 63, 
which Germany instituted in 1972, eligible individuals could retire two years 
earlier than normal without financial penalty. In 1976, Sweden lowered the 
retirement age from 67 to 65. 

This productivity gain argument has been generally accepted as an 
explanation of the century-old trend toward a shortened working life. 


Unemployment 


During the 1970s, early retirement took a new turn as the oil crisis hampered 
labor demand. Persistently high jobless rates resulted. For Europe, this was 
a new phenomenon: unemployment soared from a stable low (1% to 3%) up 
to between 7% and 10%. Growing unemployment has accelerated the trend 
toward early retirement. Recall that, in the United States (and many other 
countries too), social security was set up in response to the Great Depression 
of the 1930s. Graebner (1980: 167) observed, “Retirement .. succeeded where 
work-sharing programs did not, largely because it seemed to offer a relatively 
inexpensive method of reducing unemployment.” Likewise, preretirement 
programs in Europe have been set up in response to rising unemployment. 
Preretirement legislation was an ideal stopgap measure, which could be 
removed when unemployment declined. In France, Germany, and the United 
Kingdom, preretirement programs were, not surprisingly, terminated with 
economic recovery, even though unemployment rates stayed high. 

Along with these temporary preretirement programs, firms created private 
intermediary “pathways” to span the period between exit from work and entry 
into the public old-age pension system. Individuals were encouraged to use 
them as well as long-term sickness benefits, disability schemes, and unem- 
ployment insurance as de facto pension funds (Kohli et al 1991). Putting an 
end to such practices has turned out to be much more difficult than terminating 
preretirement programs. 

Some countries also lowered the retirement age. In 1983, France responded 
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to high unemployment by reducing the admission age for a full retirement 
pension from 65 to 60 for wage-earners who had contributed at least 37.5 
years to the Old-Age Fund. This policy helped relieve financial pressure on 
the Unemployment Compensation Fund, which had been providing generous 
benefits to older unemployed workers under guaranteed income schemes. In 
effect, old-age pensions replaced a lower percentage of former wages than 
these schemes (Guillemard 1991b: 146). 

In 1962, the American Congress extended to men the opportunity, 
previously granted to women, to retire at 62. The economic justification for 
this measure was the plight of older unemployed workers who had difficulty 
finding new jobs. It was argued that these men should be able to go on 
retirement earlier than usual. According to Schultz et al who see early exit 
as a functional “handmaiden of unemployment policy” (1991: 138), “Almost 
immediately more than half of new old-age pension beneficiaries exercised 
the option, and the number retiring early has increased steadily over the years" 
(1991: 134). 

Given the strong institutional response to unemployment, some researchers 
have found empirical evidence linking persistently high unemployment to 
older men's decreasing labor force participation. Casey & Wood (1993) 
maintain that unemployment and British men's early retirement from 1979 to 
1986 were closely correlated. They cite evidence that a 196 rise in unemploy- 
ment results in a 996 fall in 60—64 year-old men's participation rate. However, 
other analysts suggest at most an indirect linkage between the employment 
situation and early exit. 

Sorensen (1992) has shown that retirement, defined as leaving the labor 
force and no longer looking for work, is linked to labor market structures 
through the employment contract. He identifies three types of contracts: for 
single tasks, for jobs, and careers. He argues, "The shorter and more open 
the employment relationship, the more closely performance variation will be 
related to the wage rate .. the less closely performance variation is linked to 
earnings, the more likely it will be that retirement will be forced upon the 
employee" (Sorensen 1992: 5). The job structure influences how retirement 
and unemployment are linked: “High unemployment may trigger much earlier 
retirement in countries where closed employment tends to predominate while 
having little impact where employment relations are more open" (Sorensen 
1992: 18). 

The close link in Great Britain between unemployment and retirement may 
be more a function of open employment relations rather than of unemployment 
per se. 

For Naschold & de Vroom (1993) as well, the link between unemployment 
and early exit may be much weaker than is widely believed. Drawing on the 
German and Dutch experiences, they argue, "it can be illustrated that the 
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pre-retirement trend already started before the collapse of the labour market 
and has since then become a more or less relatively autonomous trend, which 
continued when there was a revival of the labour market. Of course, we cannot 
deny that developments in the labour market have had a substantial acceler- 
ation effect, however not an initiating one" (Naschold & de Vroom forthcom- 
ing: 3). 

Unemployment cannot explain the early exit trend in all countries. As Aaron 
(1993 210) points out: "When a fundamental social and economic change 
occurs in many countries at about the same time, one should entertain the 
possibility that something systematic is going on. It may be .. some common 
reaction to rising incomes may be occurring everywhere." 

Unemployment, therefore, may be a misleading index of the macroeco- 
nomic forces that generate early exit. If Sorensen's analysis of job structures 
is correct, many factors underlying the early exit trend are likely to be country 
specific, with institutional factors playing a key role. In most of continental 
Europe, older workers on long-term unemployment are admitted early into 
social security's pension system. In countries like the United States, no such 
institutional coverage is available. In Japan, a worker who reaches the 
retirement age of 60 has the right to 300 days of unemployment insurance 
because, after a lifetime of employment, he is entitled to a period of rest 
before resuming work. Both increasing affluence and productivity gains as 
well as unemployment rates are in accounting for the early exit trend. 


Restructuring the Employment Relationship 


Some scholars see evidence of a change from continuous, long-term employ- 
ment to contingent, or market-mediated, employment relationships character- 
ized by short-term, part-time or temporary contracts. 

For Abraham (1990), this restructuring of the employment relationship 
marginalizes workers because employers no longer have a long-term commit- 
ment to them. Workers become less secure as companies try to hedge against 
a volatile business cycle. 

For Standing (1986) too, increasing "flexibility" in labor management is 
marginalizing older workers. Marginalization occurs when full-time secure 
employment is replaced with a flexible form of labor in the context of a 
massive labor surplus: "In effect, large-scale horizontally integrated job 
structures have been giving way to more loosely linked systems in which 
promotion based on seniority, experience and length of service is more limited 
and where a higher premium is placed on flexibility and adaptability" 
(Standing 1986: 335-36). In this type of industrial restructuring, older 
unskiled workers, once they lose their jobs, have a hard time finding new 
ones. Contrary to this, Hutchens (1993) in fact points out that the most 
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systematic policies of discouraging older workers target the most privileged 
among them. 

Another possibility is that the restructuring of jobs in the American economy 
contributes to both early exit and expanded job opportunities for older workers. 
Early exit occurs, as Standing has observed, when firms dissolve older 
workers’ long-term contracts. The means of undoing these obligations range 
from outright dismissal to early retirement incentives supplemented by 
attractive one-time “windows of opportunity” plans (mainly through the 
forementioned ERIPs). 

But employers, given their preference for short- rather than long-term 
employment relationships, might also be more willing to hire older workers. 
The latter can be attractive if they combine two qualities: occupational 
know-how and lessened wage demands (because of supplemental sources of 
income from social security or private pension funds). This alternative 
explanation is compelling if long-term contracts are an important barrier to 
the employment of older workers. 

However, the growth of contingent work might be driven by a very different 
process, namely, larger firms seeking to subcontract work so as to ensure 
product reliability and gain access to skills and expertise. Jobs in subcontracted 
companies might be increasingly secure. Instead of increasing, marginaliza- 
tion may be shifting from small to large firms. 

Disentangling these alternative explanations is difficult in an economy 
undergoing a short-term recession. In an economic slump, few workers of 
any age are hired. In the short run, one can only observe the termination of 
employment relationships. Friedman (1992) has carefully analyzed whether 
the changing economy can generate enough jobs for mature workers. 


Age-Productivity Profiles 


Age-wage profiles are the empirical basis of the economic theories that link 
productivity to age. Those firms that use internal labor markets to allocate 
employees to jobs rely upon promotions and wage hikes as motivational 
devices. Promotions make the age-wage profile slope upwards. If, owing to 
combined on-the-job training and experience, workers increase productivity 
throughout their careers, then the upwardly sloping age-wage profile matches 
productivity increases. But, seniority effects may also come into play 
(Blanchet 1992: 16), i.e., “an excess of wages over productivity at the last 
phase of the life cycle.” This is Lazear’s (1979) well-known hypothesis that 
wages tend to be higher than productivity at the start of careers and lower at 
the end. But Doeringer (1990) has pointed out that research on the older 
workers’ capacities and performance has been neglected. 

Blanchet reports only one highly significant effect of aging: a change in 
the “balance between a firm’s aggregate production and its wage bill when a 
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seniority rule is widely applied. In this case, "aging may imply large financial 
disequilibrium for firms and may encourage them to rely on early retirement 
(Blanchet 1993: 16). 

In a thoughtful paper, Sorensen (1992: 17) points out: "Firms providing 
employment contracts covering whole career trajectories have strong incen- 
tives to control retirement rates." According to him, the main socially 
acceptable means firms have to manipulate the retirement process are private 
pensions, mandatory retirement or a combination of both. 

Age-productivity theories of early retirement have to withstand two critical 
tests. First, the assumption of a declining productivity profile has to be 
empirically corroborated. Second, it must be proven that firms actually 
manage labor forces by incorporating early exit into their strategies. 

Is there empirical evidence of an excess of wages over productivity at the 
end of careers? Might wage growth over the life cycle be a consequence of 
productivity growth? For Hutchens (1993: 45), the hypothesis that "older 
workers are overpaid in the sense that their wage lies above their marginal 
productivity is only speculation. At present, we do not have sufficient evidence 
to draw strong conclusions." Having extensively analyzed American com- 
panies' pension policies, Weiss (1992: 24) argues, "I believe that the typical 
plan was established without much attention to the incentive effects that it 
might create... I believe that for the most part the important incentive effects 
of the plans were only emphasized after the fact, primarily by economists; 
the plans were not typically designed with these incentive effects in mind." 
Economists do not agree about the significance of the seniority effect. For a 
review of the evidence, see Straka (1992). 

Sociologists have adopted a somewhat different view of the links between 
age and productivity. They have redefined productivity in terms of the 
externalization and internalization of labor market costs. Naschold & de 
Vroom have recently completed a study of firms' strategic behavior with 
regard to the exit process in seven countries and six different industrial sectors. 
This study of small and large firms, firms in the public and private sectors, 
and firms in growing and declining industries sought to identify the institu- 
tional factors underlying a company's tendency to externalize early exit costs 
by shifting them elsewhere, particularly onto the state. These researchers have 
also identified several policies sponsored by firms, as in Japan, for integrating 
older employees. They found state-sponsored integration policies only in East 
Germany and Sweden. 

Naschold & de Vroom studied the hypothesis that what determines the 
firm's decision to externalize or integrate labor market costs depends on its 
production regime and interactions with the state. They suggest that, in each 
country, firms' production regimes are "mediated" by firm-state relationships. 
Three types of such relationships are identified: the state dominates the 
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production regime; the state 1s disconnected from it; and the state and firm 
reciprocally influence each other. Especially in systems where the firm-spe- 
cific production regime is determined by external structures and regulations, 
“the strategic room for firms to produce their own firm-specific age profile 
seems to be very low” (Naschold & de Vroom 1993: 17). 

Guillemard (forthcoming), director of a research project under the auspices 
of the French Ministry of Labor, is developing a national typology of firms’ 
strategic behavior toward older employees. A national survey of firms was 
completed in the summer of 1992, and the preliminary results of detailed case 
studies point to labor market structures and employment contracts as deter- 
minants of early exit. 


INTERACTIONS BETWEEN THE LIFE COURSE, THE 
WELFARE STATE, AND LABOR MARKETS AS A 
COMPREHENSIVE INTERPRETATION 


A new research perspective focuses on the interrelations between the life 
course and welfare state as an explanation of the early exit trend. Though still 
few in number, these studies open toward a global, macrosociological 
explanation that takes into account political economy’s socioeconomic factors 
as well as “social morphology” (Halbwachs 1970). 

The opening of this new perspective was preceded by the renaissance of 
the life-course as a theme in sociology at the initiative of scholars such as 
Riley et al (1972). More recently, other scholars have helped to conceptually 
clarify interactions between the state and life course. Mayer & Schoepflin 
(1989) review research on how the welfare state has helped construct the life 
course: “whichever direction the future development of the welfare state may 
take, state activity will have a large effect on shaping individual lives and the 
social structures of the life course” (Mayer & Schoepflin 1989: 189). 

As several historians and sociologists have pointed out (Aries 1973, 
Anderson 1985, Graebner 1980, Hareven 1982, 1986, Mayer & Müller 1986, 
Kohli 1986, 1987), the institutionalization of the life course is a major way 
the welfare state has shaped individuals’ lives. The building of the welfare 
state has resulted in formal, universal rules, which set the eligibility 
requirements for coverage against certain risks or the provision of certain 
services. The principal rule has been the chronological criterion of age. Owing 
to the century-long development of welfare systems, the life course has 
gradually become an institution that organizes the individual's life span into 
a series of successive positions conditioning his livelihood, his plans, and the 
symbolic reference marks used to make personal plans. 

Kohli (1987) has demonstrated the decisive part played by the German 
retirement system in constructing and consolidating the tripartite life-course 
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model that has come along with industrial society. For one thing, retirement 
systems have helped establish a hierarchy between the three major stages. 
The central stage of adulthood corresponds to the working life. It is surrounded 
by youth, devoted to education as a preparation for work, and old age, 
associated with inactivity. Retirement systems have placed the time to rest at 
the end of life and subjected it to the requirement that the individual contributes 
to production during his working years. Along with other social policies 
(notably education), they have used chronological criteria to mark the 
thresholds between stages in the life course. Just as the obligatory ages for 
attending and leaving school mark childhood and adolescence, the age of 
entitlement to a full retirement pension marks entry into old age. 

This “chronologization” has entailed standardizing the life course. Everyone 
passes in a foreseeable manner from one stage to another at the same 
chronological age. Entry into the world of work occurs at the same age for 
each level of education. Definitive withdrawal from work occurs at the same 
age for everyone. As retirement systems have been extended, the withdrawal 
age has been fixed as a function of the age of entitlement to a full retirement 
pension. The ages of definitive withdrawal have tended to reduce their 
dispersion. The timing and arrangement of stages in the life course have been 
increasingly standardized, and individuals’ behaviors, normalized. 

Kohli’s most original finding has to do with how retirement systems, 
because of their part in institutionalizing the life course, socialize and exercise 
control over people. He refers to this social control as the “moral economy,” 
a phrase borrowed from Thompson (1971). Owing to retirement systems, the 
life course is now seen as a continuous, ordered movement of individuals, 
the basic units of modem society. Having introduced specific forms of 
individualized social control, this new “life-course regime" fulfilled the 
requirements of what was becoming industrial society: new collective strate- 
gies with regard to time, collective and statistical (instead of family) solidarity, 
and a work ethic. 

These articles propose a global, comprehensive analysis that incorporates 
the cultural and symbolic aspects of the life course into the classical trilogy 
of political, economic, and social factors. A collective project (Kohli et al 
1991) that inquired into the meaning of early exit in several industrialized 
countries has demonstrated the heuristic value of this theoretical perspective, 
even though certain conclusions are open to debate. A comparative analysis 
(Guillemard 1989, 1991a, Guillemard & Van Gunsteren 1991) of these data 
has interpreted the significance of how early exit has reorganized the transition 
from work to nonwork. 

One might presume that rapidly lowering the definitive withdrawal age has 
resulted from an acceleration of the century-long trend for the elderly to stop 
working (See section on Massive Early Exit). Accordingly, the timing of 
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retirement has changed, but the life course’s tripartite organization has not 
been affected: early exit is symmetrical to young people’s later entry into the 
world of work, and the time of work 1s being concentrated on a narrower 
age-group. In this case, work still determines how the life course is socially 
organized. The chronological markers of the three stages, though modified, 
are still fixed in relation to entry into and withdrawal from the world of work. 
The meanings of work and retirement remain unchanged. Hence, changes in 
definitive withdrawal ultimately represent a change in the timing of retirement. 
The institutionalization and standardization of the life course have been 
maintained. 

But another interpretation is possible, based on evidence of a dein- 
stitutionalization of the life course. Guillemard has worked out this interpre- 
tation. Reorganizing the threshold for definitive exit is not so much a 
consequence of applying the logic of retirement to younger workers and thus 
rescheduling the life course as it is part of a deep process reshaping the 
relationship between the welfare system and the life course. This change 
threatens the tripartite model that emerged with industnalization. Several 
aspects of this process call for attention. First of all, is the transition from 
work to retirement stil] standardized, or is it being individualized? Has early 
exit made the lıfe course more flexible? If so, stages in the life course will 
no longer follow each other in the uniform, foreseeable order set by the 
tripartite model, wherein everyone had a place in a succession of hierarchically 
arranged stages with clearly marked bounds. Secondly, does the individual 
still control his own definitive exit, as he used to do when he could decide, 
once eligible, whether or not to go on retirement? Thirdly, have the boundary 
markers between work and nonwork changed? The eligibility age for a full 
retirement pension used to be the milestone signaling entry into old age. Are 
markers other than chronological ones now being used to stake out the life 
course? 

New institutional welfare arrangements, with their eligibility requirements, 
which have authorized shifting the boundary between work and nonwork, 
provide evidence that the massive early exit trend is socially reconstructing 
the life course and old age. By comparing these arrangements, Guillemard 
has raised theoretical questions about the social redefinition of old age, the 
distribution of work and free time over the life course, and the latter’s overall 
reorganization. She argues that the early exit trend undoes the close linkage 
between the retirement system and the life course’s tripartite model. New 
means of social protection have taken the retirement system’s place in 
regulating definitive withdrawal. The chronological milestones marking the 
life course are being torn up, and functional criteria based on the individual’s 
capabilities and efficiency are increasingly being used to stake out the end of 
careers. This 1s especially noticeable when, instead of public retirement funds, 
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disability insurance or unemployment compensation regulate early exit. 
Taking away the chronological organization of the life course entails 
destandardization. There is no longer a clear, orderly transition from full-time 
work to full-time, definitive inactivity. Entries into and exits from the labor 
force follow each other, and no set order seems fixed for ending a career and 
going on retirement (Burkhauser & Quinn 1990, Doeringer 1990, Bluestone 
et al 1990, Reimers & Honig 1989). "Bridge jobs” help span the period 
between these two events. Nowadays, no one working in the private sector 
knows at what age and under what conditions he will stop working. Retirement 
as a social status and as a system of redistribution no longer provides a clear 
and foreseeable pathway out of the labor force. Definitive withdrawal from 
the labor market in the later years of life has become variable, imprecise, and 
contingent. 

The standardized, normative life course is coming undone. This accounts 
for something other interpretations have left unexplained: that the early exit 
trend is lasting despite reforms in the institutional arrangements used as early 
exit pathways and despite changes in the labor market or improvements in 
the employment situation. This fact casts doubt upon explanations that rely 
exclusively on systems of social protection or the labor market. It also serves 
as evidence for considering the life course to be a major institution for 
socializing individuals, shaping their views as well as plans (hence their 
relations to time and to the future), and distributing work and nonwork over 
the hfe span. 

Some authors (Kohli 1991: 22, Paukert 1984) have come to a different 
conclusion. Though admitting that the transition toward retirement is "longer 
and fuzzier," they think that the tripartite model is still firmly established and 
that age is still the key criteria for organizing the life course. In an enlightening 
article, Henretta (1992: 265) proposes a conceptualization for reconciling these 
two types of interpretation: uniformity of life course transitions “distinguishing 
two facets: universality in event occurrence and variability in event timing." 
In other words, early withdrawal is becoming universal, but it is variable in 
form because of changes in institutions (e.g. the lesser role of public retirement 
funds) and in the timing of withdrawal or definitive exit from work. For 
Henretta, the standardization of the life course on an age-basis has not been 
undone, but it has become more complex to study. Further research should 
help settle this debate. 

The interpretation of early exit that highlights interactions between the life 
course and systems of social protection has exposed the need to change our 
conceptualization of retirement both as an institution and as a process or 
transition from one stage to another in the life course. By taking into account 
economic, social, and cultural factors, such a comprehensive interpretation 
enables us to grasp the interactions between fluctuations in the labor market, 
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modifications in systems of social protection, and changes in the ways 
individuals conceive of the distribution of work and nonwork throughout their 
lives and in the ways they view their future. 


THE FUTURE OF EARLY EXIT 


Our analysis has drawn attention to multiple facets of the policy controversy 
about early exit. In speculating about the future, we must keep in mind the 
perspectives of, respectively, the state, firms, and individuals. 

Concerned about pension costs, many governments are enacting, or trying 
to enact, legislation to stop or reverse the early exit trend. In the United States, 
the 1983 amendments to the Social Security Act, among other measures, 
called for gradually increasing the age of entitlement to a full old-age pension 
from 65 to 66 by 2009, and to 67 by 2027. Furthermore, a higher rate of 
reduction will be gradually applied to pensions drawn before this age. 
Germany’s 1992 Pension Reform Act followed the American example. By 
2001, the age limits for most social security pensions will be gradually 
increased to 65. In Japan, a first attempt to raise the retirement age from 60 
to 65 failed to pass parliament. In 1992, the Conservative government in 
Sweden unsuccessfully tried to abolish the costly system of partial retirement 
pensions. In 1990, the Swedish state did close down the early exit pathway 
through the disability fund. 

In the meantime, private early exit arrangements, especially in the form of 
severance pay, have expanded. Are private arrangements offsetting the state’s 
attempts to restrict early exit? 

For the state, demographic changes are reducing the ratio of social security 
contributors to beneficiaries, whence the pressure to raise contributions and 
lower benefits in order to remain fiscally viable. Raising the retirement age 
and providing work incentives to older persons are, therefore, primary policy 
options. The state’s “can’t afford” policies could gradually reverse the early 
exit trend. 

Another way of envisioning the future from a labor market perspective is 
to learn from the experiences of low early exit countries like Japan, Sweden, 
and, to a lesser extent, the United States. The primary lesson is that withdrawal 
from the labor force does not necessarily mean a dichotomous choice between 
working or not working. Early exit can be seen as a process that entails gradual 
redefining of work at the end of careers by lowering wages, reducing hours, 
or making new job assignments. Furthermore, an income package could be 
worked out so that wages plus public or private pension benefits offset any 
loss of earnings. The cooperation of firms must be obtained for this. Of course, 
they might pursue policies contrary to those of the state's, as in the United 
States where private pension funds encourage early exit whereas public policy 
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favors increasing the retirement age. In this situation, the cost of redefining 
work and repackaging income falls on the individual and his family. 

The experience of low early exit countries suggests that reorganizing work 
at the end of careers implies lowering older workers’ wages. This can be 
achieved in different ways. In the United States, it occurs via “bridging jobs” 
(Doeringer 1991); in Sweden, through partial retirement pensions; and in 
Japan, through social conventions that allow most large firms to lower wages 
after the mandatory retirement age of 60. This de facto adjustment of wages 
to age may help cope with the age-wage productivity gap. To make this 
adjustment “bloodlessly,” it would be necessary to combine public and private 
social protection to offset the wage decline. 

What is the individual’s perspective on early exit? The more the state and 
firms shift the costs of early exit onto individuals, the less attractive it will 
be. But cost-shifting is likely to occur after, rather than before, the individual 
has stopped working. In the United States for example, private pension funds 
usually cover early retirees’ medical costs. As a result, General Motors is 
now spending more on health care than on steel. Under pressure to reflect 
future financial commitments in its current budget and given lower profits, 
GM has changed its policy: individual retirees are now expected to assume a 
greater share of medical costs, resulting in a decrease in their net disposable 
income. The individual cannot anticipate such uncertainty, especially if the 
firm is in a waning industrial branch. 

Studies about the meaning of early exit might shed light on cultural 
differences among individuals. Most Germans believe they have a moral right 
to retire early after a lifetime of work; the social obligation to make room for 
the young reinforces this “moral economy” perspective (Kohli 1993). British 
workers who have exited early tend to see themselves in an intermediary status 
"too old to work and too young to retire." In this limbo between work and 
nonwork, they are neither unemployed nor retired (Casey & Laczko 1989: 
523). Instead of a “moral economy,” there is a “moral obligation” in Japan, 
where older employees tend to feel they should continue working to fill job 
vacancies. The effectiveness of this depends on whether the old can substitute 
for the young in industries where labor is scarce (Ito 1992: Ch. 8). 

Although comparative studies suggest a wide variety of country-specific 
responses, many analysts see clear evidence of a shift: “from forced exit from 
the labour market to anticipated retirement, taken earlier and earlier” (Conrad 
1991: 200). 

But this review of subjective reactions suggests that this optimism is 
premature. Variations among countries—and probably within countries too— 
may come out of the tension between the powerful labor market forces 
generating early exit and the state’s equally strong “can’t afford” response. 
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Torn between a labor market propelling early exit and a state retarding the 
process, “anticipated retirement” so often leads to a feeling of being in limbo. 

This discussion of the future has emphasized national patterns and their 
dependence on public and private institutions. Attention has been drawn to 
the conflicting perspectives that emerge when early exit 1s analyzed in terms 
of the labor market, the state, and the individual. Understanding the future 
of early exit depends on embracing a broad approach of this sort. Clearly, 
changes during the past 15 years have affected the transition from employment 
to retirement, and modified the conception of retirement as an institution 
founded on solidarity between generations. Theorists as well as decision-mak- 
ers must take these changes into account. 
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